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l:lIEARy_SCREENlNG_METHOD 

l££lE£IoHSi_2f_the_Invention 

A genomic DNA "library" is formed by digesting 
genomic DNA from a particular organism with a suitable 
restriction enzyme, joining the genomic DNA fragments to 
vectors and introducing the DNA fragment - containing 
vectors into a population of host cells. Complementary 
DNA (cDNA) is DNA which has been produced by an enzyme 
known as reverse transcriptase which can synthesize a 
complementary strand of DNA (cDNA) using a mRNA strand as 
a template. A cDNA library is formed by joining the cDNA 
fragments to vectors and introducing the cDNA fragment- 
containing vectors into a population of host cells. 
15 In a DNA or cDNA library, the pieces of DNA exist as 

an unordered collection of thousands or millions of 
pieces. To isolate a host cell carrying a specific DNA 
sequence (i.e., a specific DNA clone), the entire library 
must be screened. Radioactively labeled or otherwise 
labeled nucleic acid probes are traditionally employed to 
screen a DNA or cDNA library. Nucleic acid probes 
identify a specific DNA sequence by a process of in vitro 
hybridization between complementary DNA sequences in'thl 
probe and the DNA clone. 
25 A specific DNA clone that has been identified and 

isolated in this manner can contain DNA that is con- 
tiguous to the probe sequence. A terminus of the DNA 
clone, therefore, can be used as a new probe to rescreen 
the same or another DNA library to obtain a second DNA 
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clone which has an overlapping sequence with the first 
DNA clone. By obtaining a set of overlapping DNA clones, 
a physical map of a genomic region on a chromosome may be 
constructed. This process is called "chromosome walking" 
5 because each overlapping DNA clone which is isolated is 
one step further along the chromosome. Each DNA clone 
also can be studied to determine its genetic relationship 
to a previously mapped genetic function and, thus, a 
series of overlapping DNA clones provides a physical map 
20 of 3. chromosome which may be correlated to a map of 
genetic functions . 

Chromosome walking is used, for example, to identify 
or localize a gene of interest, such as one thought to be 
causative of or associated with a disease or other 
15 condition, phenotype or quantitative trait. This is done 
by using a DNA fragment which displays a restriction 
fragment length polymorphism (RFLP) shown to be genetic- 
ally linked to (i.e., physically localized to the same 
chromosome region as) a gene which causes or is 
20 associated with a disease, or other condition, phenotype, 
or quantitative trait or a segment of DNA contiguous to 
such a RFLP or a cDNA, as an in vitro hybridization probe 
to screen a DNA library and pull out larger fragments of 
DNA in which all or part of the probe sequence is repre- 
25 sented. 

The usefulness of any DNA clone isolated in this 
manner is that it includes DNA that is contiguous to the 
RFLP sequence that is incrementally closer to the 
position of the sought-after gene than the original RFLP. 
30 '^^ 5®^ ^ step closer, a labeled molecule corresponding to 
an end of the newly isolated DNA clone is prepared and 
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used to rescreen the library, with the goal being to 
isolate DNA clones that overlap with sequences found in 
the first DNA clone and that are incrementally closer to 
the gene of interest than either the starting probe or 
5 the first DNA clone isolated. This procedure is repeated 
as needed, with the resulting DNA clones being used in 
genetic studies to assess whether they are more closely 
linked to the gene of interest. To walk over a distance 
of 10 million base pairs using presently- available 
10 chromosome walking techniques could require from 100 to 
2,000 steps, depending on the DNA cloning vector system 
used. Any approach designed to decrease the work 
required to take a single walking step or which would 
allow multiple walking projects to be carried out simul- 
taneously would be a major advance. 

The number of DNA clones which would be required to 
form a complete library of genomic DNA is determined by 
the size of the genome and the DNA clone capacity of the 
vector used to clone and propagate the segments of the 
genomic DNA. Construction and screening of genomic DNA 
libraries of organisms with large genomes is labor 
intensive and time consuming. The development of vectors 
having a capacity for large DNA clones has helped to 
reduce the labor involved in screening genomic libraries. 
However, screening libraries remains time consuming and 
labor intensive. 

Summar2_of_the_Invention 

The present invention is a method of identifying and 
isolating a DNA fragment of interest (a target DNA 
fragment), from a DNA fragment library in a eukaryotic 
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host cell, which is based on homologous recombination 
between the target DNA fragment and DNA present in a 
targeting DNA molecule introduced into the DNA fragment 
library. It further relates to targeting vectors and DNA 
fragment libraries constructed in eukaryotic host cells 
as described herein. 

The method of the present invention is used to 
screen a DNA fragment library constructed in a eukaryotic 
host cell in which genetic recombination (exchange of 
information between DNA present in an artificial unit (or 
episome) or in a chromosome in the host cell and DNA 
introduced into the host cell) occurs by means of homo- 
logous recombination. In eukaryotic host cells, DNA 
fragments are propagated in the form of an episome or 
other artificial unit which is replicatable in the 
eukaryotic host cell. The episome or artificial unit 
includes, in addition to the DNA fragment, sequences 
which can be used for propagation in bacteria, one or 
more marker genes for selection in bacteria and one or 
more marker genes for selection in the eukaryotic host 
cells . 

In one embodiment of the present method, in which 
the eukaryotic host cell is yeast, genetic recombination 
occurs essentially exclusively by homologous recombi- - 
nation. DNA. fragments in host cells are propagated in 
the form of artificial chromosomes which include, in 
addition to a DNA fragment insert, all of the DNA 
sequences necessary for the chromosome to participate in 
host cell replication and mitotic segregation in a manner 
similar to that of naturally-present host cell chromo- 
somes. In general, the artificial chromosome is present 
in one copy or low-copy number in a host cell. 
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The present method makes use of a targeting vector 
or vehicle which: 1) includes a DNA sequence, referred 
to as targeting DNA, homologous to at least a portion of 
the target DNA fragment and a selectable marker gene 
which is functional in host cells under appropriate con- 
ditions and 2) is non- replicating in the host cell. 
Targeting DNA can be any DNA sequence, including genomic 
DNA. cDNA and DNA synthesized using known techniques. 
Preferably a double - strand break is made in the targeting 
DNA present in the targeting vector, which generally is 
circular when purified from an coli host. Alterna- 
tively, a gap can be introduced by' jnaking: two cuts in the 
targeting DNA (e.g.., with appropriately selected restric- 
tion en2yme(s)). The break or gap renders the vector 
linear, provides DNA ends which stimulate homologous 
recombination with host cell artificial chromosome 
sequences and increases the efficiency of stable trans- 
formation by homologous recombination. 

The targeting vector is introduced into cells 
harboring the DNA fragment library, producing a mixed 
population of host cells, some of which contain the 
targeting vector and some of which do not. The resulting 
population of host cells is maintained under conditions 
appropriate for homologous recombination between DNA 
already present in the cell (i.e., prior to introduction 
of the targeting vector) and homologous sequences, such 
as those in the targeting vector. Subsequently, the 
population of cells is subjected to conditions appro- 
priate for selection of host cells in which homologous 
recombination has occurred. Because the targeting vector 
is unable to replicate in the host cell, stable trans- 
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formation with the selectable marker gene can occur only 
through homologous recombination. The selectable marker 
gene is replicated and, therefore, confers a stable 
phenotype, only in host cells in which homologous 
5 recombination with sequences that are replicatable in the 
host has occurred. Identification of such host 
cells--and, thus, of host cells containing the target DNA 
fragment of interest- -is carried out by culturing the 
population of host cells under conditions (e.g., 
IQ culturing on appropriate media) in which only those host 
cells in which homologous recombination (and stable 
transformation) occurred can survive. Growth of a 
transformed host cell is indicative of the presence of 
the target DNA fragment. Host cells containing a target 
15 DNA fragment are, as a result, separated or isolated from 
host cells which do not contain the target DNA fragment. 
The target DNA fragment can be removed from the host cell 
and sequenced or manipulated (e.g., subcloned or mapped), 
using known techniques. 
20 Alternatively, targeting DNA and a selectable marker 

gene for selection in yeast can be introduced into yeast 
cells containing the DNA fragment library by mating a 
yeast strain containing the targeting DNA and the select- 
able marker gene on a targeting vehicle which is a 
25 replicating yeast linear plasmid with the yeast host 
cells containing the library. In this embodiment, the 
two yeast strains must be of opposite mating types. 
Homologous recombination occurs between the targeting 
linear plasmid and a library YAC having DNA homologous to 
30 targeting DNA, producing two linear molecules, each of 
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which is a YAC . In one embodiment, the linear plasmid 
has negatively selectable markers flanking the targeting 
DNA sequence. Each of the two recombination products 
carries one of the two negatively selectable markers, 
making differential selection of the two recombination 
products possible. In another embodiment of the method 
in which mating of opposite mating type yeast strains is 
used, a first yeast strain containing a yeast replicating 
plasmid, constructed in such a manner that the targeting 
DNA and a first selectable marker gene can be freed from 
the yeast replicon by recombination events and a second 
selectable marker gene, which is a negatively selectable 
marker gene, is used to select the replicon itself. When 
this strain is mated to all members of a YAC library, the 
freed targeting sequence can undergo recombination with 
YAC molecules within the library. 

The replicating yeast plasmids described above can 
also be introduced into host cells containing YACs by 
transformation. 

In a preferred embodiment, the DNA fragment library 
is constructed in yeast, such as Saccharomyce s (S.) 
cerevisiae or Schizosaccharomyces (S. ) 2£Sbe, in which 
DNA fragments are present in yeast artificial chromosomes 
(YAC). Each yeast host cell contains one YAC or a few 
YACs, each present in one or few copies. A YAC includes, 
in addition to a DNA fragment, all of the DNA sequences 
required for chromosomes to replicate in yeast, segregate 
chromosomes to their progeny and stabilize chromosome 
ends. In this embodiment, the targeting vector used is a 
bacterial plasmid or other vector which does not repli- 
cate in yeast and includes targeting DNA and a selectable 
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marker gene that functions in yeast. The targeting 
vector, which preferably has been linearized by intro- 
ducing a double-strand break within the targeting DNA of 
the bacterial plasmid, is introduced into yeast cells. 
ThB resulting mixed population of yeast cells is main- 
tained under conditions appropriate for homologous 
recombination to occur between targeting DNA and target 
DNA in the YAC. This is followed by selection of yeast 
cells stably transformed with the targeting DNA and 
selectable marker gene. Stable transformation of the 
yeast cells confers on them a selectable phenotype, such 
as antibiotic resistance, nutrient prototrophy (such as 
amino acid prototrophy or nucleoside prototrophy) , 
tolerance to a metal ion, ability to progress through the 
cell cycle or expression of a cell surface marker. 
Growth of yeast cells under conditions compatible with 
survival only of stably transformed cells is indicative 
of the presence of the target DNA sequence. Target DNA 
can be removed from the yeast cell and sequenced or 
manipulated, using known techniques. 

The present invention also relates to targeting DNA 
molecules and vectors useful in the present method. 
Vectors include targeting vectors, such as bacterial 
plasmids which do not replicate in yeast and include 
targeting DNA and a selectable marker gene functional in 
yeast. They may also include a selectable marker gene 
for selection in bacteria. Additional targeting DNA 
molecules include replicating molecules, such as a yeast 
linear plasmid. 

YAC arm vectors useful in the present method are 
also the subject of the present invention. These include 
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a yeast selectable marker gene, a bacterial origin of 
replication, a bacterial selectable marker gene, a yeast 
telomere, and one or more cloning sites at which 
targeting DNA is introduced or inserted into the vector. 
5 In addition, YAC arm vectors can include yeast centromere 
sequences and/or a yeast replication origin. YAC arm 
vectors which are the subject of the present invention 
include those designated pTKENDA, pTKENDA2 , pTKENDB, 
pTKENDC, pTKENDD and their functional equivalents. 
10 The present invention further relates to eukaryotic 

host cells, particularly yeast cells, constructed as 
described herein and useful for construction of YAC 
libraries from which a DNA fragment of interest can be 
identified and isolated by the claimed method. In 
15 addition, the present invention relates to DNA fragment 
libraries, particularly YAC libraries, constructed in 
such eukaryotic host cells. 

In one embodiment, a DNA fragment library is con- 
structed in a yeast host strain carrying a chromosomal 
20 deletion of four selectable marker genes (i.e.. the four 
selectable marker genes normally present in the yeast 
strain genome have been deleted). The yeast strain has 
incorporated into it a pair of YAC arm vectors, each of 
which includes the following elements: a yeast select- 
25 able marker gene which is one of the four selectable 
marker genes deleted from the yeast host strain; a bac- 
terial origin of replication; a bacterial selectable 
marker gene and a yeast telomere. The yeast selectable 
marker gene in each member of the pair of YAC arm vectors 
30 is different from that present in the other member of the 
pair. In one embodiment, the yeast strain carries a 
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chromosomal deletion of the ARG4 gene, the TRPl gene, the 
LEU2 gene and the URA3 gene. A pair of YAC arm vectors 
can include any combination of pairs of these marker 
genes; each member of the pair includes a marker gene 
different from that included in the other member of the 
pair. In one embodiment, in which a two-library system 
is used, the yeast host strain carries a chromosomal 
deletion of four selectable marker genes and two pairs of 
YAC arm vectors are used, each carrying a selectable 
marker gene deleted from the yeast host strain and not 
present in the other member of the pair of YAC arm 
vectors in which it is used. Such yeast host strains and 
YAC arm vectors are described in detail herein. 

The method, targeting vectors, YAC arm vectors and 
DNA fragment libraries of the present invention are 
useful for identifying and isolating a target DNA frag- 
ment, which can be genomic DNA or cDNA and can be an 
entire gene, gene portion or other DNA sequence. The DNA 
in DNA fragment libraries screened by this method can be 
of any type, such as, but not limited to, mammalian 
(particularly human), plant, insect, avian, fish, 
crustacean, molluscan, viral, nematode, amphibian, 
reptilian or protozoan. For example, they can be used to 
identify and isolate a gene associated with a particular 
disease, condition, phenotype, or quantitative trait, 
related genes within an organism's genome, and cDNA. 

Further, as described herein, physically contiguous 
DNA sequences can be identified in a YAC library in yeast 
cells (or other DNA fragment library) and used to con- 
struct a physical chromosome map. That is, the present 
method is useful for chromosome walking. In this embodi- ' 
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ment, a first YAC containing a target DNA fragment is 
isolated, using the claimed homologous recombination- 
based method described herein, and a terminus of the 
fragment is subcloned. In many instances, both termini 
will be subcloned in order to determine the correct 
direction for the walk to proceed. The terminus of the 
first target DNA fragment is then used as the targeting 
DNA present in the targeting vector, which is introduced 
into the YAC library. A second target YAC is isolated, 
which has as part of it the target DNA fragment, which 
partially overlaps the first target DNA in sequence. The 
second terminus is subcloned and us^d as the targeting 
DNA in a targeting vector introduced into the YAC 
library. This results in isolation of a third YAC 
containing a target DNA fragment, which partially over- 
laps the second target DNA fragment in sequence. This 
process results in isolation of a series of YAC con- 
taining target DNA fragments which partially overlap and 
can be repeated as many times as needed to construct the 
physical map sought. Chromosome walking can be carried 
out by the method of the present invention by using DNA 
which displays a restriction fragment length polymorphism 
(or RFLP) , a DNA fragment contiguous to a RFLP or a cDNA. 
as targeting DNA in the targeting vector to screen a YAC 
library, A terminus of target DNA isolated in this 
manner is subcloned or isolated and the resulting 
sequence used to isolate a contiguous DNA fragment. This 
is repeated as often as needed to construct the physical 
map and. optimally, to reach a desired gene with which, 
for example, the RFLP is associated. 
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The method of the subject invention has numerous 
advantages over other approaches to screening DNA 
libraries- For example, it is possible to screen a DNA 
fragment library many times, simultaneously. Libraries 

5 are stored as a pool of clones, thus eliminating the work 
needed to organize and screen a library that is distri- 
buted over many filter membranes. The labor needed to 
screen a library is considerably less than that needed 
with conventional methods. In addition, terminal 

10 sequences are isolated from YAC clones without the need 
for subcloning in a form suitable for subsequent walking 
steps . 

Brief Description of the PgggiTigs 

Figure 1 illustrates the identification of target 

X5 DNA fragments in a YAC library by the homologous - 

recombination selection method of the present invention. 
The YAC includes telomeres (arrowheads), centromere/yeast 
origin of replication (filled circles), and a DNA frag- 
ment; in the case of clone #3, the DNA fragment contains 

20 within it a target DNA fragment (solid rectangle). 

Figure 2 illustrates targeting (homologous recipro- 
cal recombination) to generate a YAC that is marked for 
selection , 

Figure 3 illustrates selection by homologous 
25 recombination of a DNA clone from a DNA YAC library using 
one-step gene disruption* 

Figure 4 illustrates selection of DNA clones by 
homologous recombination using two DNA YAC libraries. 

Figure 5 is a map of plasmid pl84DLARG. B: BamHI ; 
30 Sm: Smal; P: PstI; ARG4: yeast ARG4 gene (arrow 
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indicates direction of transcription); Cm: chlor- 
amphenicol resistance gene; ORI (pACYC184): Origin of 

replication from pACYC184; : hypothetical 

targeting sequence inserted into cloning site. 
5 Figure 6a is a plasmid " map *of pTKENDA. 

Figure 6b is a plasmid map of pTKENDB . 
Figure 6c is a plasmid map of pTKENDC. 
Figure 6d is a plasmid map of pTKENDD. 
Figure 7 is a restriction enzyme and Southern blot 
10 analysis of clones selected by targeting with human 
epsilon- and beta-globin sequences. 

Figure 8a contains oligonucleotides used in the 
construction of YAC arm vectors. The sequences in upper 
case letters indicate bases corresponding to oligonucleo- 
15 tides synthesized in vitro. The sequences in lower case 
letters indicate those bases filled in in vitro using 
each pair of annealed oligonucleotides. Relevant 
restriction enzyme recognition sequences are indicated. 
Figure 8b contains oligonucleotides used in the 
20 construction of YAC arm vectors. The sequences in upper 
case letters indicate bases corresponding to oligo- 
nucleotides synthesized in vitro. The sequences in lower 
case letters indicate those bases filled in in vitro 
using each pair of annealed oligonucleotides. Relevant 
restriction enzyme recognition sequences are indicated. 

Figure 8c contains oligonucleotides used in the 
construction of YAC arm vectors. The underlined base 
indicates the mutation from the wild-type sequence. 

Figure 9 is photograph of a restriction enzyme and 
Southern hybridization analysis of DNA from eight yeast 
colonies isolated by screening with fragment 8A. Lanes 
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1-4: clones 8A.1, 8A.2, 8A . 3 and 8A.4; Lane 5: plasmid 
pl84-8A. Lanes 6-7: clones 8A,5 and 8A.6; Lane 8: an 
example of DNA from an isolated colony which does not 
show the unit-length-linear band; Lane 9: clone 8A.11. 
5. 1 microgram of total yeast DNA was loaded in lanes 1-4 
and 5-9. 2 nanograms of plasmid pl84-8A was loaded in 
lane 5. The electrophoresed DNA samples (all digested 
with Kpnl) were transferred to a nylon membrane and 
hybridized with a 32-P labeled ARG4 DNA probe. The arrow 
10 marks the position of the unit- length - linear band at 8.3 
kb. 

Figure 10 is a photograph of a* restriction enzyme 
and Southern hybridization analysis of DNA from each of 
the positive clones digested with Xhol and with either 

15 Kpnl (for those isolated by screening with fragment 8A) 
or Avail (for those isolated by screening with fragment 
lOB). Samples were electrophoresed on a 1% agarose gel, 
transferred to a nylon filter, and hybridized with ^^P 
labeled pBR328 (Boehringer Mannheim Biochemicals , 

20 Indianapolis, IN). Lanes 1-7: clones 8A-1, 8A.2, 8A.3, 
8A.4, 8A.5, 8A.6, 8A.11 (all isolated by screening with 
fragment 8A) ; lanes 8-10: clones 10B.6, 10B.29, 10B.41 
(isolated by screening with fragment lOB) . 

Figure 11 shows analysis of YAC DNA for presence of 

25 unit- length-linear fragments hybridizing to an ARG4 DNA 
probe: Lane 1: EcoNI digest of plasmid 

pl84DLARG/PCRF.5, which contains the 852 base pair PstI 
fragment from the human ADA locus cloned into the PstI 
site of pl84DLARG. 1 nanogram of digested plasmid DNA 
30 was loaded; Lanes 2-3: empty (no samples loaded); Lanes 
3-6: EcoNI digested YAC DNA (approximately 1 microgram) 
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from candidate tr ansf ormant s 184ADA.B, 184ADA.C, and 
184ADA,D. The electrophoresed samples were transferred 
to a nylon membrane and hybridized to a 32-P labeled 
fragment of ARG4 DNA. The arrow indicates the position 

5 of EcoNI linearized plasmid pl84DLARG/PCRF, 5 (5.2 kb) . 
Figure 12 is a schematic representation of one 
embodiment of the present homologous recombination 
method, in which a YAC containing target DNA is 
identified using recombination with a linear yeast 

10 plasmid. 



^Ii^f-5£l£llEtion_of_ATCC_De£osits 

The following deposits have been made at the 
American Type Culture Collection (June 28. 1990) under 
the accession numbers indicated. These deposits have 
15 been made under the terms of the Budapest Treaty and all 
restrictions upon their availability will be removed upon 
granting of a United States patent . 



1, Saccharomyces cerevisiae strain TD7-16d, ATCC 
No. 74010. 

20 2. Plasmid pl84DLARG, ATCC No. 40832. 

3. Plasmid pTKENDA. ATCC No. 40833. 



^£iSli£^_5£scri£tion_of_the_Inventi^ 

The present invention is based upon Applicant's 
discovery that the process of homologous recombination 
25 which occurs in eukaryotic cells can be used for the 

purpose of screening DNA fragment libraries constructed 
in eukaryotic cells and identifying and isolating a DNA 
fragment of interest, referred to as a target DNA frag- 
ment . 
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The present invention is a method of isolating a DNA 
fragment of interest, referred to as a target DNA frag- 
ment, from a DNA library constructed in a eukaryotic host 
in which genetic recombination occurs by homologous 
recombination. The target DNA fragment is generally 
present in a larger fragment contained in the eukaryotic 
host cell. The DNA used to construct the DNA libraries 
may be cDNA or genomic DNA which is of human or other 
origin, including that of plants and other mammals. A 
target DNA fragment is identified by the present method 
by introducing into the DNA fragment library a non* 
replicating targeting vehicle which contains targeting 
DNA and an appropriate selectable marker gene and identi- 
fying eukaryotic host cells in which homologous recombi- 
nation occurs between target DNA and targeting DNA, 
Homologous recombination results in stable integration of 
targeting DNA and the selectable marker gene into DNA in 
host cells, which are identified on the basis of a 
selectable phenotype conferred as a result of stable 
transformation of host cells with the selectable marker 
gene. For example, they are identified on the basis of 
their ability to grow under conditions (e.g., in the 
presence of a drug or metal ion or in the absence of an 
essential nutrient) incompatible with growth of host 
cells in which stable integration has not occurred. 

The DNA library used in the present method is a 
population of eukaryotic host cells, such as yeast celTs', 
containing a unit, such as an artificial chromosome, - " 
which includes a DNA fragment insert and is replicated in 
the host cells. The DNA library is screened for DNA 
fragment insert (s) , present in the artificial chromosome. 
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all or a portion of which is a target DNA fragment, by 
introducing into the eukaryotic host cells a targeting 
vehicle, such as a bacterial plasmid, which is non- 
replicating in the eukaryotic host cells and includes a 
targeting DNA sequence (i.e., a DNA sequence homologous, ^ 
at least in part, to the target DNA) and a selectable 
marker gene useful for selection in the host cell. Host 
cells containing the targeting vehicle are cultured under 
conditions appropriate for homologous recombination 
between the targeting DNA sequence and target DNA to 
occur. Host cells stably transformed with the selectable 
marker are subsequently iden t if ied . ( i . e . ; by identifying 
host cells able to grow under conditions under which 
non-stably transformed cells cannot grow, and die). 

In general, the targeting vehicle is nonreplicating 
in the host cell, such as a bacterial plasmid, and 
includes the targeting DNA sequence and a selectable 
marker gene- for selection in the host cell. However, in 
certain embodiments, such as those in which the host cell 
is yeast, the targeting vehicle may be replicating 
vehicle, such as a yeast linear plasmid, which includes 
marker genes for selection in yeast and targeting DNA. 

In a specific embodiment of the present invention, 
which is exemplified by the Examples which follow, the 
DNA library is a population of yeast cells which contain 
artificial chromosomes carrying a DNA fragment insert and 
host cells containing target DNA are identified and - 
isolated from this YAC vector library. 

A targeting vehicle, such as a bacterial plasmid, 
which is non-replicating in yeast is introduced into the^" 
population of host yeast cells containing the DNA YAC 
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library. The bacterial plasmid includes a targeting DNA 
sequence which is homologous, at least in part, to target 
DNA of interest and a selectable marker gene that 
functions in yeast. Preferably, the targeting plasmid is 
cut with a restriction endonuclease that introduces a 
double-strand break within the targeting DNA sequence, 
thereby linearizing the bacterial plasmid and providing 
DNA ends which are recombinogenic , to stimulate the 
process of hamologous recombination with the YAC 
sequences. The efficiency of homologous recombination 
is, as a result, increased. Because the plasmid is 
non-replicating in yeast, stable transformation with the 
selectable marker can only proceed by integration into 
natural or artificial yeast chromosomes , 

The resulting host yeast cell population, which 
includes stably transformed host yeast cells (i.e., those 
in which the plasmid, including the selectable marker 
gene, has been stably integrated by homologous recombi- 
nation into DNA already present in host cells prior to 
introduction of the targeting vehicle) and non-stably 
transformed host yeast cells, is cultured under con- 
ditions such that only stably transformed yeast cells are 
able to grow. In a correctly targeted event, the entire 
plasmid is stably incorporated in the host yeast cells by 
homologous recombination between the targeting DNA 
sequence of the plasmid and homologous sequences (i.e., 
target DNA fragments) in the YAC. In other embodiments, 
such as that in which a linear targeting molecule is 
used, it is not necessary, however, for the entire 
plasmid to become stably incorporated, as long as homo- 
logous recombination occurs to an extent sufficient to. 
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introduce a selectable marker gene into DNA already 
present in the host cell, such as in a YAC. Only those 
few host yeast cells which contain a target DNA frag- 
iDent(s) and have thereby undergone homologous recombi- 
5 nation with the targeting plasmid are able to grow under 
the conditions used (e.g., in antibiotic-containing 
medium or medium lacking a nutrient essential to non- 
stably transformed cells), due to the introduction of the 
yeast-selectable marker gene contained on the targeting 
10 plasmid. They are identified on the basis of the 

selectable phenotype conferred by stable transformation 
of the selectable marker gene. 

To prevent homologous recombination events between 
the plasmid-borne yeast - selectable marker gene and 
homologous sequences in the host yeast cells, it is 
preferable that host cell sequences homologous with 
targeting vector sequences have been deleted or almost 
entirely deleted from the genome of the host yeast strain 
before it is used for the YAC vector library. Alter- 
natively, host cell sequences homologous with a yeast- 
selectable marker gene on the incoming targeting plasmid 
can be retained as a mutated, non - f unc t ional portion of 
the yeast chromosome. If this approach is used, however, 
more positive scores for homologous recombination will 
have to be screened to ensure that homologous recombi- 
nation events which occur took place between the tar- 
geting DNA sequence on the bacterial plasmid and the 
target DNA sequence present on the YAC. 

Figure 1 illustrates schematically the isolation of. 
target DNA fragments from a YAC vector library by the 
method of the present invention. The targeting plasmid 
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on the far left is introduced into a population of yeast 
cells (ovals), each of which contains a DNA YAC con- 
taining a different DNA fragment. The plasmid includes a 
selectable marker gene for selection in yeast (diagonally 

5 lined section) and a targeting DNA fragment (solid sec- 
tion) in which a double strand break has been introduced. 
In this example, one host yeast cell (#3) contains a DNA 
fragment in a YAC that is homologous to a sequence 
carried on the targeting plasmid (solid sections on clone 

10 Tir3). Recombination between these two sequence occurs, 
resulting in the stable integration of the selectable 
marker carried on the plasmid into *the yeast chromosome 
(YAC) . The resulting population of cells is grown under 
conditions appropriate for selection of host yeast cells 

15 stably transformed with the selectable marker gene. For 
example , they are plated on appropriate selec tive media , 
such as nutrient deficient media. Only those cells in 
which the selectable marker gene functions grow. Growth 
of cells under these conditions is indicative of the 

20 presence of a target DNA fragment. Although YAC are 
exemplified herein, other yeast vectors , such as YCp 
vectors (YCp50, YCpl9) can be used to construct a DNA 
library. 

The general scheme for selection of a target DNA 
25 fragment from a DNA YAC library is shown in Figure 2. 
Figure 2 illustrates the integration of a targeting 
plasmid (pl84DLARG) carrying a selectable marker (the 
yeast ARG4 gene; open box) and a segment of DNA that is 
homologous to a sequence in the DNA YAC library 
30 (targeting DNA; solid arcs on plasmid). The thin lines- 
represent an insert of human or other (non-yeast) DNA 
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propagated as a yeast artificial chromosome (YAC) . The 
solid black box is the target DNA fragment, a sequence of 
DNA which is a portion of YAC DNA present in a DNA clone, 
found in the library, that is homologous to the targeting 
sequence. The remaining portions of the DNA YAC are 
comprised of the YAC vector arms: the thick lines 
represent plasmid vector sequences for replication and 
selection in bacteria. The shaded boxes represent 
genetic markers used for selection in yeast (yeast 
selectable markers URA3 and TRPl). The solid arrowheads 
and circle represent telomeres (TEL) and a centromere/ 
yeast replication origin (CEN/ARS), respectively. Figure 
2a depicts the targeting DNA (present in the targeting 
vector) aligning with the target DNA fragment in the YAC. 
Figure 2b depicts the product of homologous recombination 
between the targeting DNA and target DNA fragment. The 
targeting plasmid has been cut uniquely in the targeting 
DNA, at the site corresponding to the vertical arrow in 
the target sequence. ULL indicates the unit length 
linear restriction fragment that results from duplication 
of the target sequence (and the restriction site) on the 
YAC. As described in Example I, a ULL can be generated 
only if integration occurs into a DNA sequence that 
contains the restriction enzyme site in question and 
contains sufficient homology surrounding that site to 
allow resynthesis (by repair) of the restriction enzyme 
site on the targeting plasmid. Candidate clones that 
display a ULL are assumed to be homologous recombination 
events and are analyzed further. 

In another embodiment of this method, a yeast- 
selectable marker gene on the incoming targeting DNA 
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molecule can be a bacterial gene, engineered to be 
expressed in yeast, which confers drug resistance to 
yeast cells, e.g., the CAT or neo genes from Tn9 and 
Tn903,. or bacterial amino acid or nucleoside prototrophy 
5 genes ^ e.g-i the ££H argH, trpC, and pyrF genes. 

In another embodiment of the method of the present 
invention, the targeting vector is a linear DNA fragment 
which includes a targeting DNA sequence homologous to a 
target DNA fragment to be identified and/or isolated from 
10 the YAG library. In this embodiment, a selectable marker 
gene is inserted into the targeting DNA, producing a 
targeting DNA sequence which includes two non- contiguous 
domains. This embodiment is described in detail in 
Example II and represented schematically in Figure 3. 
15 The targeting vector, which is a linear sequence which 
does not replicate in yeast, is transformed into the 
pooled DNA YAC library, as described in Example I. 
Homologous recombination occurs between the targeting DNA 
and the target DNA fragment. 
20 In addition to the above- described embodiment, other 

approaches to introducing targeting DNA into host cells 
can be used. For example, targeting DNA can be present 
on a replicating yeast linear plasmid (Murray, A.W. and 
Szostak, J.W., Nature 305:189-193 (1983)) in a yeast 
25 strain of mating type opposite to that of the host strain 
used for the library. The linear plasmid has selectable 
markers flanking the targeting DNA sequence (i.e., one at 
each end of the targeting DNA); both markers are 
different from those used in the construction of the YAC 
30 library and can be selected against (i.e., negatively 
selectable markers, such as LYS2, URA3 or CYH2) . 
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Homologous recombination between two linear molecules 
produces two linear molecules, each of which is a hybrid 
of the two parental molecules. In this embodiment, in 
which recombination occurs between the targeting linear 
5 plasmid and a library YAC, each of the two recombination 
products is a YAC and each carries one of the two 
negatively selectable markers, allowing for differential 
selection of the two recombination products. 

The basis of this differential selection is illus- 
10 trated in Figure 12. Filled circles . , arrowheads , and 
open rectangles represent centromeric, telomeric, and 
marker gene sequences, respectively. The shaded boxes 
represent targeting or target sequences. URAS"*" cells can 
be selected against (killed) by growth on media 
containing the nucleoside analog 5 - fluoro- orotic acid 
(5F0A), while LYS2'*" cells can be selected against by 
growth on media containing the amino acid analog alpha- 
amino-adipic acid (aaa) . Molecule 1 is a target YAC 
constructed in a vector system using ARG4 and TRPl as 
20 selectable markers (phenotype arg*^ trp"*" SFOA^aaa^) . 
Molecule 2 is a linear targeting plasmid in which the 
targeting sequence is flanked by URA3 and LYS2 (phenotype 
arg trp" 5F0A aaa^) . The phenotype of cells harboring 
molecules 1 and 2 in an unrecombined form is arg"*" trp"*" 
25 5F0A aaa^. Molecules 3 and 4 are the products of 

recombination between Molecules 1 and 2 , resulting from a 
cross-over between the targeting and target sequence. 
The phenotype of Molecule 3 is arg"^ trp" 5F0A^ oaa^ . and 
can be selected for by growth on 5F0A plates lacking 
arginine. The phenotype of Molecule 4 is arg" trp"*" 5F0A^ 
aaa , and can be selected for by growth on aaa plates 



30 



wo 93/03183 



PCr/US91/08679 



.24- 



lacking tryptophan. Thus, cells containing one or both 
non-recombinant molecules, as well as cells containing 
either of the recombinant products can be differentially 
selected (cells harboring only one or the other - recombi - 
nant product arise by random loss events). 

In such a scheme, the yeast cells harboring the 
targeting linear plasmids are mated to all members of the 
library and maintained under conditions favorable for 
spontaneous or induced homologous recombination (induced 
by, for example, meiosis or ultraviolet irradiation). 
Recombinant target YACs are selected by virtue of the 
unique phenotypes of the recombination products resulting 
from homologous recombination between the targeting 
sequence on the linear plasmid and YAC molecules 
harboring a suitable target sequence. Each of the two 
product YACs is truncated at the position of the target 
DNA sequence, and the differential selection is used to 
isolate the two products separately. In order to isolate 
the two products of the single event, yeast cells 
harborings YACs and linear targeting plasmids are prefer- 
ably plated or gridded out prior to selection for 
recombinants. Selection is accomplished by replica 
plating onto the appropriate selective plates. 

In this embodiment, the relative orientation of the 
targeting sequence with respect to the two (negatively) 
selectable markers on the linear targeting plasmid is 
important. Recombination between a target YAC and only 
one of the two orientations of targeting linear plasmid 
will give rise to a stable recombinant (i.e., a recombi- 
nant with one and only one centromere) . YAC molecules 
with two centromeres show frequent breakage and unstable 
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phenotypes; YAC molecules with no centromere are highly 
unstable by virtue of segregation bias. In one embodi- 
ment, linear targeting plasmids are constructed with the 
targeting sequence present in both orientations and 
introduced into the library in separate matings. 

As an alternative to mating to introduce linear 
targeting plasmids into the library, linear targeting 
plasmids can be introduced into host cells containing 
YACs by transformation, essentially as described in 
Example I . 

In another embodiment, a yeast replicating plasmid 
carrying a targeting sequence can be constructed in such 
a manner that the targeting DNA and a selectable marker 
(SMI) can be freed from the yeast replicon by natural or 
induced recombination events, and such that the replicon 
itself can be selected against by virtue of a negatively 
selectable marker (SM2), such as URA3 , LYS2 or CYH2. 
Examples of inducible recombination systems which can be 
engineered to function for this purpose are the fl.£ 
mediated recombination pathway of the yeast 2-micron 
plasmid and the cre^^lox recombination system of bacterio- 
phage PI. The plasmid is introduced into a yeast strain 
of mating type opposite to that of the host strain used 
for the library. After mating to all members of the YAC 
library, the targeting DNA sequence and selectable marker 
are released as a non- repl icating molecule and the 
selectable marker can only be stabilized by homologous 
recombination with a YAC harboring a suitable target DNA 
sequence. The targeted recombinants are selected by 
^plating onto media which selects for SMI and against SM2 . 
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As an alternative to mating to introduce the plasmid 
described in the preceding paragraph, the plasmid can be 
introduced by transformation, essentially as described in 
Example I, followed by the induction step to free the 
5 targeting substrate from the yeast replicon. 

Ident if ic ation and Igc^lgtion of a Target DNA Fragment 
Usin fi Homologous^Re comb ina t ion 

The above-described embodiments of the present 
method are useful to identify and isolate any target DNA 

jQ fragment, which can be an entire gene, a gene portion or 
other nucleotide sequence. For exalnple, a gene of 
interest, such as a ^-globin gene or adenosine deaminase 
gene^ can be identified in a DNA fragment library using 
the claimed method and, if desired, isolated from host 

j2 cells by known methods. Identification of target DNA 

fragments by the present method is described in detail in 
Examples I, V and VI. 

Hoinglogous "Recombination__Chroroos 

The method of the present invention, by which a 

20 target DNA fragment is isolated from a DNA library, is 
useful for isolating physically-contiguous DNA segments 
from a DNA YAC library in order to construct a physical 
chromosome map. That is, when used iteratively, each 
time with targeting DNA derived from a YAC which overlaps 

25 with and extends beyond a previously identified region, 
it is a method for chromosome walking. In the present 
method of chromosome walking, a target DNA fragment 
present in a YAC is isolated, as described above. A 
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terminus of this first target YAC fragment is subcloned 
into a plasmid vector. The terminus of the first DNA 
fragment is, thus, used as a second targeting DNA 
sequence, which is introduced into host yeast cells 
containing a DNA YAC library. The terminus of the first' 
DNA fragment, which is contiguous to the first target DNA 
sequence, in turn becomes the second targeting DNA 
sequence. As used herein, the term contiguous includes 
sequences which are immediately adjacent to the first 
target sequence and those nearby or in proximity to the' 
first target sequence (i.e., separated from the first 
target sequence by intervening nucleotide (s )) . This 
second targeting DNA sequence should not have any 
homology with the first targeting DNA sequence, so that 
when it in turn is incorporated in a YAC at a point of 
homology with a second DNA clone, the second DNA clone 
selected will have a different terminal DNA sequence. 
The terminal subfragment from the second DNA clone is 
used to isolate the next (i.e., the third) DNA clone. 
Each successive DNA clone is isolated by virtue of its 
homology with the terminal subfragment of the previously 
isolated DNA clone-. A series of overlapping clones is 
obtained by repeating this process; the process is 
repeated as needed to construct the physical map desired. 
The successive recovery of terminal DNA fragments allows 
rescreening the same library or a second library for 
overlapping clones. 

In one embodiment of the present invention, chromo- 
some walking is carried out in order to determine the 
chromosomal location of a gene of interest, such as a 
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gene which causes a disease, by using a DNA fragment 
displaying a RFLP genetically linked to the gene of 
interest, or a fragment contiguous with the RFLP, as 
targeting DNA in the targeting vector. A targeting 
5 vector, such as a bacterial plasmid, which includes the 
RFLP-displaying DNA, or fragment contiguous to the RFLP 
displaying DNA, or cDNA as targeting DNA and a selectable 
marker gene, is introduced into a human DNA YAC library. 
Homologous recombination between the targeting DNA and a 
10 target DNA fragment in the library results in the first 
step in walking to the gene of interest. A YAC con- 
taining the target DNA fragment is identified in this 
way. One terminus or both termini of the target DNA 
fragment is used as targeting DNA in a targeting vector 
15 to rescreen the same library or screen a second library, 
as described above. Also as described above, this is 
repeated, each time using a terminus of the target DNA 
fragment isolated in the previous step as targeting DNA. 
This continues until the gene of interest is identified 
20 or the desired physical map is completed. 

In another embodiment of the present method of 
homologous-recombination chromosome walking, the terminal 
fragments from the DNA YAC inserts can be isolated by a 
plasmid-rescue technique. This embodiment is described 
25 in detail in Example III and represented schematically in 
Figure 4» In this case, the YAC vectors are designed 
such that the YAC vector arm contiguous to the DNA 
fragment (clone) insert terminus contains sequences which 
allow for plasmid replication and selection in a 
30 bacterial host. Restriction enzyme digestion of the 

selected YAC DNA clone produces a fragment with one end 
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lying within the terminus of the DNA clone sequence and 
extending into the YAC vector arm. This fragment con- 
tains the bacterial plasmid sequences which are 
essential for replication and selection in coll . 
covalently linked to a fragment of DNA from the terminus 
of the selected YAC DNA clone. Plasmid rescue involves 
restriction enzyme digestion of the total yeast DNA from 
the selected yeast clone; ligation of the digested yeast 
DNA to form monomer circles; and transformation of this 
ligated DNA mixture, into E_. c o 1 i , with the selection for' 
the marker gene in £ o 1 i . 

For use in conjunction with the plasmid rescue 
technique, one can design two different DNA YAC 
libraries. Each library will utilize a different pair of 
15 selectable markers. A set of four YAC arms are designed 
containing appropriate selectable markers for the two 
different libraries. Each YAC arm contains a yeast- 
selectable marker that would be appropriate for the 
selection of host yeast cells of the other library. In 
Figure A, the yeas t- selectable markers in Library 1 are 
ARG4 and TRPl and in Library 2 they are LEU2 and URA3 . 

Total yeast DNA from cells containing the first 
targeted DNA YAC clone are digested with a restriction 
endonuclease that separates the sequence conferring 
replication and stability function in yeast from the 
region of the YAC cloning vector that allows selection 
and propagation in bacteria and a selectable marker that 
functions in yeast (step 3 in Figure 4). This region 
remains covalently attached to sequences containing the 
first targeted DNA fragment terminus.^ This fragment of 
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the YAC DNA clone terminus contains sequences necessary 
for replication in bacteria, a selectable marker for 
selection in bacteria, and a selectable marker for 
selection in yeast, along with the first targeted DNA 
clone terminus sequence. This fragment is circularized 
and amplified in bacteria (step 4 in Figure 4). This 
product then becomes the targeting plasmid with which to 
transform the second DNA library, after introducing a 
double-strand break within the sequence corresponding to 
the DNA clone terminus (i.e., within the trageting DNA 
sequence) (steps 5 and 6 in Figure 4) . The two DNA YAC 
libraries. Library 1 and Library 2, are constructed so 
that the arms in each are stabilized by a different 
vector sequence^ with each arm having a unique selectable 
marker for selection in yeast and a unique selectable 
marker for selection in bacteria. 

The rescue of DNA clone termini described in Example 
III utilize-s restriction endonucleases to cleave a DNA 
clone in such a manner that the terminus is covalently 
attached to a fragment of the YAC vector arm. One of 
ordinary skill in the art will know how to isolate DNA 
clone termini by use of various embodiments of the 
polymerase chain reaction (PCR) (for example, inverse PCR 
or anchored PCR) with such reaction using at least one 
unique primer that anneals to the YAC vector arm 
immediately adjacent to the DNA cloning site, such that 
the first strand synthesis proceeds away from the YAC 
vector arm and copies cloned DNA, and in which specific 
restriction enzyme cleavage sites comprise part of one or 
both of the PCR primers which would facilitate the 
subcloning of terminal fragments from DNA YACs. 
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Even in the absence of homologous recombination 
screening, a two library system is particularly useful 
for chromosome walking. Two key characteristics of such 
a two library system are that: 1) among the total of 
four arms which must be present in the final two - • 
libraries, no arm shares the same marker for plasmid 
selection in coli and 2) there is limited or no 
homology between the bacterial plasmid replicons used in 
the two different libraries. 

In this system, the two unique terminal sequences 
from clones isolated by plasmid rescue (see Example III) 
from the first YAC library (Library 1) can be isolated 
independently simply by plating on different selective 
media plates. Since the isolated plasmids harboring the 
terminal sequences have limited or no homology to either 
vector arm present in the second YAC library (Library 2), 
these plasmids can be used in traditional filter hybri- 
dization screening without subcloning the terminal 
sequences from the plasmid. The plasmids rescued in E^ 
coli can be purified and labeled (e.g., by nick- 
translation or random hexamer priming), and used directly 
to screen a second library. YAC clones isolated from 
Library 2, themselves isolated by screening with intact 
rescued plasmids carrying terminal sequences from YAC 
clones isolated from Library 1, represent steps taken in 
a chromosome walk. Each walking step thus proceeds by 
using labeled plasmids derived from the ends of YAC 
molecules isolated from one of the two libraries to 
directly screen the other, complementary, library. This ' 
method greatly improves the efficiency of traditional 
filter screening techniques by providing a rapid method 
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for independent isolation of each of the two YAC termini 
by differential selection in forms suitable for direct 
labeling and library screening. It eliminates the need 
to subclone or otherwise purify terminal fragments for 
the purpose of labeling and screening for overlapping YAC 
clones . 

The design of the YAC vector arms and the restrict- 
ion enzymes used for plasmid rescue should be such that 
the yeast selectable marker (as well as the centromeric, 
telomeric, and yeast replication sequences) is separated 
from the rescued plasmid sequences and the YAC clone 
terminus. This eliminates the need to use different 
yeast selectable markers in the construction of Libraries 
1 and 2, and to construct a host yeast strain with 
complete deletions of the selectable markers used to 
select for YAC clones in Libraries 1 and 2. Unique 
selectable markers for each of the four arms, which make 
plasmid selection in coli possible, can be, for 
example, a gene encoding resistance to an antibiotic, 
such as chloramphenicol, kanamycin, ampicillin, tetra- 
cycline, spectinomycin, streptomycin, or erythromycin, or 
a gene encoding a biosynthetic marker for which a suit- 
able auxotrophic host exists. 

Bacterial replicons which can be used in order to 
limit the homology between those in the two libraries 
are, for example, pl5A, ColEl, phage M13, phage f 1 , phage 
Lambda and their equivalents. 

Host Cel l Types an d Characteristics 

The method is described herein with particular 
reference to screening YAC DNA libraries constructed in 
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yeast cells through the use of targeting DNA sequences 
present in bacterial plasmids. It is to be understood, 
however, that this is merely for purposes of exempli- 
fication and that the present method can be carried out 
5 as^ng other host cell types, provided that genetic re-, 
combination betveen vector-borne DNA and DNA already 
present in the host cell occurs by homologous recom- 
bination and that an appropriate non-replicating 
targeting vector is available. 
10 Appropriate eukaryotic host cells include those 

which normally (as they occur in nature) undergo genetic 
recombination essentially exclusively by homologous 
recombination (e.g., Sacchar omjce s cer evis iae , 
Schizosaccharom^ces Eombe) . As used herein, the term 
15 essentially exclusively means that homologous recombi- 
nation occurs without significant levels of non- 
homologous recombination under the conditions used. 

Homologous-recombination selection of DNA clones 
could be utilized as a selection method in the cells of 
any organism in which 1) a suitable DNA cloning system 
exists and 2) the cells can be manipulated or induced by 
genetic engineering or genetic manipulation to perform 
recombination which is predominantly based on DNA 
sequence homology, or in which the targeting DNA can be 
treated in such a manner that it engages in homologous- 
recombination as its preferred mode of recombination. 
With these criteria met. one skilled in the recombinant 
DNA arts could perform homologous - recombination selection 
of DNA clones from a DNA library. Such organisms may 
include, but are not limited to, Schizosaccharomx-ce s 
£ombe , Dr OS o£h i 1 a me 1 ano^as ter , Homo s agi en s , Mus 
5Hl£iilHS and S2odo£tera f rugigerdea . 
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Saccharo myces cerevisia e is a preferred host 
organism for the selection of DNA clones using homo- 
logous-recombination because of its ability to route 
transforming DNA carrying double - s trand breaks into a 
recombination pathway based virtually exclusively on DNA 
sequence homology. 

Certain characteristics of host cells in which DNA 
fragment libraries are constructed should be considered 
and possibly modified to optimize use of such cells in 
the present method, such as by decreasing non-targeted 
events and, thus, increasing the efficiency of the 
method. For example^ as described below, it might be 
necessary to remove selectable markers present in the 
targeting vector from host yeast cells and to construct 
targeting vectors in such a manner that they include no 
sequences homologous with those in the vector sequences 
used in the propagation of the DNA library. 

As described below, it has been determined that the 
selectable marker gene(s) chosen for the targeting vector 
should not normally be present in the host yeast genome 
or should be deleted from normal chromosomal position(s) 
in the host yeast strain. Without this modification of 
the host strain, recombination events between the select- 
able marker and the yeast genome would occur at a higher 
rate. For near-complete (> 99%) coverage of the human 
genome, a DNA YAC library with an average fragment size 
of 300 kb would consist of approximately 50,000 membeirs 
(Maniatis, T. et al.. Molecular, Cloning -A_ Laboratory 
Manu al , p^ 271, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York, 1982). In order to isolate ~- 
sequences that are represented only once in such a 
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library, the ratio of targeted to non-targeted events 
should approximate or exceed 50,000 to 1, for at this 
ratio one incorrect (non- targe ted) clone will be isolated 
for every correct clone. In most cases, however, 

5 sequences in this library will be represented 3-5 times, 
and a ratio of 10,000-17,000 to 1 would be adequate. 
Non-targeted events result from recombination between the 
targeting plasmid and regions of homology in the yeast 
genome, and can be minimized by decreasing the extent of 

10 such homology. As described in Example VII, it was 
determined that deleting the chromosomal copies of 
selectable marker genes present on the vectors used is 
desirable because it reduces the occurrence of non- 
targeted events. As described in Example VII, these 

15 results indicate that the selection of a targeted clone 
(target DNA fragment) from a DNA YAC library is feasible 
and particularly efficient in host yeast cells that carry 
rio homology with selectable markers present on targeting 
vectors , 

Non-targeted events might also occur as a result of 
homologous recombination between sequences on the 
targeting vector, such as the bacterial plasmid origin of 
replication or drug resistance marker, and homologous 
sequences on the YAC vector arms used to construct the 
DNA library. This homology can be minimized by con- 
structing the targeting vector using a drug resistance 
marker that is not present in the YAC vector, and by 
using a bacterial plasmid origin of replication that is 
divergent from or non-homologous to the origin present on 
the YAC vector arms. The results described in Example 
VII also place the frequency of non-homologous recom- 
bination at approximately 0.003% (1 in 30,729), 
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consistent with the invention described in this appli- 
cation. It was possible to select yeast cells carrying 
homology to the targeting vector even when only 1 in 
10,000 of the cells transformed had such homology. In 
fact, at this dilution targeted events were isolated 
multiple (four) times, indicating that a clone repre- 
sented once in a library of 40,000 clones could be iso- 
lated. 

Targeting Vectors 

Targeting vectors or vehicles useful in the method 
described herein are also the subject of the present 
invention. One type of targeting vector of the present 
invention has two key characteristics: the vectors are 
non-replicating in the host cell in which the DKA frag- 
ment library is constructed and include a DNA sequence, 
referred to as targeting DNA, which is homologous at 
least in part to a target DNA fragment which, for the 
purposes of the invention, is a DNA fragment comprising 
all or a portion of a desired clone to be identified in 
and isolated from the DNA library. Targeting vectors 
will generally be bacterial plasmids of the Yip class, 
particularly in those cases in which yeast cell hosts are 
used. Vectors appropriate for other types of cell hosts 
can also be constructed using known techniques. 

Sequences used as targeting DNA in the targeting 
vector can be entirely homologous to the target DNA 
fragment, although they need not be. They need be only 
sufficiently homologous that under the conditions used, 
genetic recombination between vector-borne DNA introduced 
into the cells and DNA in YAC in the cells occurs by the 
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host cell recombination pathway or process. Preferably, 
a double strand break or gap is introduced into a tar- 
geting DNA sequence. The free ends adjacent to the break 
or gap can be modified to prevent recircularization 
(e.g., by .phosphatase treatment of the ends of the DNA, 
by creating non-complementary ends by using two different 
restriction enzymes or by removing nucleotides from one 
strand of the DNA, producing a single stranded tail). A 
survey of the literature reveals that s ingle - stranded 
(3') overhangs are intermediates in genetic recombination 
in yeast and other species (Sun, H. et al^ Cell, 64:1155- 
1161 (1991); Maryon, E. and Carroll, D. Mol^_Cell_^_Biol^ 
11:3268-3277). It is reasonable to expect that the use 
of DNA modifying enzymes that degrade one strand of a DNA 
duplex (such as the strand with 5'-»3' polarity) on one or 
both sides of a double - s trand break in this case, 
resulting in molecules with single stranded 3' overhangs 
on one or both sides of a double - strand break or gap) nay 
be useful in producing substrates that have enhanced 
ability to function as targeting molecules in homologous 
recombination library screening. 

In addition to targeting DNA, targeting vectors 
include a selectable marker gene that functions in yeast, 
an origin of replication and a selectable marker that, 
functions in bacteria (e.g., coli.). The selectable 
marker gene is one which is functional (makes selection 
of transformed cells possible) in the host cell type uTed' " 
for DNA.tragment library construction. The choice of the 
yeast selectable marker gene can be made from among many 
various endogenous yeast gene loci, e.g., ARG4 , LEU2 , 
HIS3. HIS4, THRl. URA3. TRPl, LYS2. A-DE2 , ADE8 .' and MET2 . 
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Alternatively , the yeast selectable marker may be a 
marker gene that is not endogenous to the yeast genome, 
but is a foreign gene that confers a selectable pheno- 
type, e.g., a bacterial gene engineered to be expressed 
in yeast and confer drug resistance on the yeast cells 
(such as the CAT or neo genes from transposons Tn9 and 
Tn903, respectively) or nutrient prototrophy, such as 
amino acid or nucleoside prototrophy (such as E, coli 
argH , trpC, or pyrF genes). Other selectable marker 
genes useful for this purpose include genes which confer 
tolerance to metal ions (e.g., the CUPl gene, which 
confers resistance to copper ions), genes which confer an 
ability to progress through the cell cycle on cells with 
a mutant phenotype and genes which result in expression 
of a cell surface marker. 

The suitable selectable marker genes for selection 
in bacteria include the genes encoding resistance to the 
antibiotics , chloramphenicol, kanamycin, ampicillin, 
tetracycline , spec tinomycin ^ s treptomycin, erythromycin, 
or any other marker, including genes encoding bio- 
synthetic enzymes for which auxotrophic bacterial hosts 
exist . 

Bacterial origins of replication may be derived from 
a variety of sources, including pl5A (exemplified by the 
origin of plasmid pACYC184) , ColEl, phage M13, phage fl, 
phage Lambda, or any other replicon that one trained in 
the art would recognize as providing an equivalent 
function. 

Vectors constructed and used to screen YAC DNA 
libraries are described in detail in Example III and 
represented schematically in Figures 5 and 6a-6d. 
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Targeting plasmid pl84DLARG contains a selectable marker 
functional in yeast (ARG4) and a bacterial origin of 
replication (derived from pACYC184) . 

Targeting DNA molecules are not limited to molecules 
5 of the Yip class. The targeting DNA can be a fragment of 
DNA purified from a larger plasmid, with such a plasmid 
constructed in such a manner that the desired targeting 
sequence is interrupted by, among other sequences, a 
bacterial or yeast replicon. The plasmid is also con- 
10 structed in such a manner that upon cleavage with a 

restriction enzyme that will release the replicon from 
the inner section of the targeting sequence, a yeast 
selectable marker remains covalently linked to the outer 
two ends of the targeting sequence. 

Alternatively, a selectable marker and a targeting 
sequence can be ligated together in vitro, and ligation- 
products consisting of one copy of the targeting sequence 
and one copy of the selectable marker (or multimers 
consisting of alternating targeting and selectable marker 
20 sequences in a uniform orientation) are purified. These 
ligation products are circularized in vitro and cleaved 
with a restriction enzyme to introduce a doub le - s trand 
break of gap in the targeting sequence and leaving the 
selectable marker intact. 

Finally, the two halves of a targeting sequence can 
be ligated to a selectable marker in a single three-way 
ligation in vitro to generate a targeting molecule 
suitable for transformation. 
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Yeast Ar m Ve c tor s 

Yeast arm vectors or YAC arm vectors, which are used 
to produce yeast artificial chromosomes, are also the 
subject of the present invention, YAC arm vectors 
include a yeast selectable marker gene, a bacterial 
origin of replication, a bacterial selectable marker gene 
and a yeast telomere. They may additionally include a 
yeast replication origin (ARS) and/or a yeast centromere 
sequences. The components of these YAC arm vectors can 
be obtained from sources in which they occur naturally or 
can be produced using recombinant or genetic engineering 
techniques or chemical synthesis. For example, the 
telomere sequences, centromere sequences and ARS can be 
obtained from yeast or from another organism. It is only 
necessary that they function in yeast host cells as, 
respectively, a telomere, a centromere or an ARS. 
Components which have equivalent functions, regardless of 
their source (e.g., yeast or other source) are referred 
to herein as functional equivalents of the corresponding 
yeast element. 

The present invention is illustrated by the 
following Examples » which are not intended to be limiting 
in any way. 

Methods Used Herein 

Unless otherwise noted, methods for plasmid purifi- 
cation, restriction enzyme digestion of plasmid DNA and 
gel electrophoresis, use of DNA modifying enzymes, 
ligation, transformation of bacteria, transformation of 
yeast by the lithium acetate method, preparation and 
Southern blot analysis of yeast DNA, tetrad analysis of 
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yeast, preparation of liquid and solid media for growth 
°^ ££ii and yeast, and all standard molecular bio- 
logical and microbiological techniques can be carried out 
essentially as described in Ausubel et al, (Ausubel, F.M. 
-- ~-' ~---2~^£2£2££i£_iB_M£i££Hiar_Biolo£y , Greene 
Publishing Associates and Wiley- Interscience , New York, 
1987). 

IXAMPLE_I SELECT10N_BY_H0MOL0GOyS^RECOMBI 

A_TARGETED_DNA_CL0NE_FR0M_A^PNA_Y 
LIBRARY 

Plasmid pYAC4 (ATCC #67380) wa's used", to construct a 
library of human genomic DNA. Human DNA was isolated 
from white blood cells (D. Burke,. Ph.D. Thesis, 
15 Washington University. St. Louis, MO,, 1988), partially 
digested with EcoRl and ligated to pYAC4 arms digested 
with EcoRI and BamHI . 

The ligation mixture was then used to transform 
yeast host strains , either MGD131-10c or IV-16d, using 
20 the spheroplast method (Burgers, P.M.J, and Percival, 
K.J. (1987) Analxtical_Biochemistri 16 3:391-397). (The 
construction of host strains MGD131-10c and IV-16d with 
the appropriate marker deletions is described in Example 
III below.) Since the pYAC4 vector carries the yeast 
25 selectable markers TRPl and URA3 , transf oroants can be 
selected for by growth on plates lacking tryptophan and 
uracil. 11,625 YACs with an average size of 190 kb (0.73 
human genome equivalents) are individually grown in the 
wells of 96-well microtiter plates; 0.1 ml was taken from 
30 each well and pooled in three subpools of approximately 
4.000 clones each. For each subpool, an equal volume of 
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30% glycerol was added and the subpool was aliquoted and 
frozen at -70"C. 

For a library comprising 73% of one genome, and 
assuming equal representation of all clones, the proba- 
5 bility that it contains any one specific human DNA 

sequence is just over 0.5. The probability that one of 
six different fragments of DNA is represented in the 
library is l-(0.5)^, or 0.98. 

The construction of the targeting plasmid pl84DLARG 
10 is described below and illustrated in Figure 5. It 

carries the yeast ARG4 gene (Beacham, I.R. et al. (1984) 
gene 2^:271-279) as a selectable marker, and its 
bacterial origin of replication is derived from pACYC184 
(Chang, A.C.y. and Cohen, S.N. (1978) Journal^of 
Bacteriology 134:1141-1156.), which shares only limited 
sequence homology to the pBR322 origin used on pYAC4, 
The entire chromosomal copy (a 2.0 kb Hpal DNA fragment) 
of ARG4 has been deleted in the library host strains 
IV-16d and MGD131-10c. The 2.2 kb Bcll-Clal fragment 
20 from pACYC184 (Chang. A.C.Y. and Cohen, S.N., (1978). 
Journal of Bac t_eri qlogy 134: 1141-1156 . ) containing the 
pl5A origin of replication and the chloramphenicol 
resistance gene was ligated to BamHI-AccI digested pMLC28 
(a derivative of pSDC12 carrying the pUC18 multiple 
25 cloning site; Levinson et al. , J^Kol , Appl . Gen^ . 

2:507-517 (1984); plasmid pUClS (ATCC #37253) can sub- 
stitute for pMLC28 in the construction of pl84DLARG 
described here), BamHI and AccI cut this plasmid one 
time each, in the polylinker. The ligation mixture was 
30 digested with Sad and Hindlll , which cut in the PMLC28 
polylinker, and the digested DNA was treated with T4 DNA 
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polymerase to generate blunt ends. The DNA was ligated 
under dilute conditions to promote circularization, and 
the ligation mix was treated with the restriction enzyme 
Avail (to linearize any parental molecules) prior to 
transformation into bacteria. One plasmid, pl84DL, 
carrying only the sequences contained within the larger 
of the two Bcll-Clal fragments of pACYC184 and a permuted 
version of a portion of the pMLC28 polylinker was identi- 
fied. Plasmid pHpaS (provided by N. Schultes and J. 
Szostak; Department of Molecular Biology, Massachusetts 
General Hospital, Boston, MA 02114) carries the ARG4 
gene as a 2.0 Kb Hpal fragment inserted into the Hindi 
site of PMCL12 (a derivative of pSDC12 carrying the pUC12 
multiple cloning site). Levinson et al., Ji_Mol^_A£Elu. 
Gen^, 2:507-517 (1984). This plasmid was cut at the Fstl 
and Smal sites flanking the ARG4 insert, and the ARG4 
fragment was ligated to Pstl-Smal cut pl84DL. A plasmid 
carrying a single copy of the ARG4 gene inserted in the 
orientation shown in Figure 5 was isolated and designated 
20 P184DLARG. Figure 5 is a map of plasmid pl84DLARG. 

Genomic fragments for tyrosine hydroxylase (chromo- 
some 11), metallothionein II pseudogene (chromosome 4). 
anonymous DNA markers D16S3 and D16S37 (chromosome 16). 
and a 1.9 kb Hindlll fragment located 5' of the epsilon 
globin gene (chromosome 11) were subcloned into pl84DLARG 
and used for selection of clones by recombination from a 
YAC library. With the exception of the tyrosine 
hydroxylase gene fragment, all of the fragments were 
blunt ended by treatment with T4 DNA polymerase and 
30 ligated to Smal cut pl84DLARG." The tyrosine hydroxylase 
gene fragment was cloned into the BamHI site of 
pl84DLARG. A 1.3 kb 
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Hpal-BamHI fragment from the 5' end of the beta globin 
gene (chromosome 11) was blunt-end ligated to the same 
2-2 kb Bcll-Clal fragment used to construct pl84DLARG» 
The beta- and epsilon- globin fragments are 1.3 and 1.9 kb 
fragments^ respectively, from the human beta-hemoglobin 
locus on chromosome 11. The beta-globin fragment (ATCC 
#39698) was subcloned from pHUS'beta (Treco , D., et al . , 
Mol. Cell. Biol^. 5:2029-203 8, 19 8 5), and includes 
sequences from positions 61,338 (Hpal site) through 
62,631 (BamHI site) in the Genbank HUMHBB sequence. Thi 
fragment includes the 5' end of the human beta-globin 
gene. The Avail site at Genbank map position 62,447 was 
used to introduce a double - strand break for targeting, 
leaving 1.1 and 0.18 kb of homology on either side of the 
break. The 5' epsilon- globin probe (ATCC ^^59157), is a 
Hindlll fragment and includes sequences centered 
approximately 15 kb 5 ' to the epsilon-globin gene (ATCC 
59157), from positions 3.266 through 5,172 in the Genbank 
HUMHBB sequence. The Apal sites at map positions 4,361 
and 4,624 were used to create a 0.26 kb double - s trand gap 
for targeting, leaving 1.1 and 0.5 kb of homology on 
either side of the gap. 

Properties of the remaining four genomic DNA frag- 
ments are as follows: tyrosine hydroxlase (chromosome 
11; 2.3 kb BamHI fragment; ATCC #59475; double- strand 
break made with Hindlll, 0 . 6 kb from end); metal- 
lothionein pseudogene (chromosome 4; 2.8 kb Hindlll -EcoRI 
fragment; ATCC #57117; double-strand break made with 
Ndel,. 0.4 kb from end); anonymous DNA marker D16S3 
(chromosome 16; 1.5 kb Hindlll fragment; ATCC #59447; 
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double-strand break made with Apal. 0.75 kb from end); 
D16S37 (chromosome 16; 2.3 kb Hindlll fragment; ATCC 
#59189; double-strand break made with Apal. 0.95 kb from 
end) . 

Each targeting plasmid was linearized with a re- 
striction enzyme that cuts within the human DNA (the 
targeting DNA) and 20 /xg of digested DNA was used to 
transform the pooled library. Equal volumes of the three 
library subpools were thawed, mixed and inoculated into 
CM -ura - trp medium containing 40 ,ig/ml each of 
kanamycin and ampicillin. This culture was grown over- 
night at 30'C with vigorous shaking and harvested at a 
density of 1.86 x lo' cells/ml. The cells were trans- 
formed using the lithium acetate method (Ausubel. F.M. et 
al., Current_Protocols_in_Molecu^^^ Supplement 
5, Greene Publishing Associates and Wiley- Interscience 
New York. 1987). 20 pg of plasmid cut within the human 
DNA was used to transform 7 x 10^ cells in a volume of 
0.2 ml. and the entire transformation mix was spread onto 
the surface of eight selective plates (complete minimal 
niedia lacking uracil, tryptophan, and arginine) and 
incubated at 30'C for 3-7 days. 

Transformants were analyzed by restriction enzyme 
digestion and Southern hybridization analysis. DNA was 
prepared from each of the candidates and digested with 
the same enzyme used to linearize the targeting plasmid 
The Southern blot was probed with ^^F radiolabeled ARG4 
DNA. Homologous integration events are identified by 
hybridization to a single band of exactly the same size 
as the linearized transforming DNA molecule [the -Unit 
Length Linear" band (ULL) ; Figure 2]. A ULL can only be 
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generated if integration occurs into a DNA sequence that 
contains the restriction enzyme site in question, and 
contains enough homology surrounding that site to allow 
the re-synthesis (by repair) of the restriction enzyme 
site on the targeting plasmid. Candidates that display a 
TILL are assumed to be homologous integration events and 
are subjected to further analysis. Unit length linears 
were seen for 6 of 21 epsilon-globin candidates analyzed 
and for 3 of 14 beta-globin candidates. No unit-length 
linears were observed in candidate clones isolated with 
any of the other targeting fragments used. 

Figure 7 is a restriction enzyme and Southern blot 
analysis of clones selected by targeting with human 
epsilon- and beta-globin sequences. In the left panel. 
DNA from nine clones selected as arg+ were digested with 
Avail (the enzyme used to make the double - strand break in 
the beta-globin targeting sequence). In the right panel, 
DNA from nine clones selected as arg+ were digested with 
Apal (the enzyme used to make the double- strand break in 
the epsilon-globin targeting sequence). The asterisks 
identify clones correctly selected by homologous recombi- 
nation. The lanes marked K were loaded with purified 
beta-globin targeting plasmid digested with Avail (left 
panel), or purified epsilon-globin targeting plasmid 
digested with Apal (right panel). The size of this 
marker fragment is identical to the size predicted for 
correctly targeted events. The arrowheads indicate the 
fragment size predicted for correctly targeted events, 
5-6 kb in the left panel and 6,2 kb in the right panel. 
Hybridization was with ^^P labeled ARG4 DNA. 
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Three each of the beta- and eps ilon- globin positives 
were further analyzed by CHEF gel electrophoresis (Chu, 
G., Vollrath, D., and David, R.W. Science . 234: 1582-1585 
(1986)), and restriction enzyme and Southern hybridi- 
zation analysis, probing with epsilon- or beta-globin DNA 
as appropriate. This analysis demonstrated that all six 
YACs are identical and carry both beta- and epsilon- 
globin DNA, as would be expected since these two genes 
lie only 40 kb apart on human chromosome 11. In all six 
YACs the ARG4 DNA has integrated onto a YAC of 190 kb and 
the pl84DLARG "constructs have integrated as predicted 
into the homologous DNA within the globin locus. 

Homologous recombination has been successfully used 
to isolate unique genes from a DNA YAC library. The YACs 
isolated encompass the entire beta-globin locus from at 
least 16 kb 5' of the epsilon gene down to the beta 
globin gene, along with about 130 kb of flanking DNA. In 
addition, a similar selection protocol performed with the 
same DNA YAC library resulted in the isolation of YACs 
from the ^-globin locus after the library had been stored 
at -70'C for over fourteen months. It is thus disclosed 
here, for the first time, that it is possible to isolate 
clones from a human DNA YAC library by homologous - 
recombination selection. 

EXAMZLE_I1 SELECTION_BYJ10MOL0G0US^RECOMBI 

5I_A_DNA_CL0NE_FR0M_A_DNA_YAC_L1^ 
USII?G_ONE.STEP_GENE_DISRUPTION 
The method of one-step gene disruption (Rothstein. 
R.J., Me tho d s_in_Enz2no 1 og:^ , 101:202-211, Academic Press, 
New York, 1983) can be adapted for use in the selection 
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of clones from DNA libraries by homologous recombination. 
In this embodiment, a selectable marker is inserted into 
the targeting sequence. The targeting sequence, with the 
embedded selectable marker, is subsequently isolated as a 
5 single linear fragment (as diagrammed in Figure 3) and 

transformed into the pooled DNA YAC library, as described 
in Example I. Correctly targeted clones arising as a 
result of homologous recombination between the targeting 
molecule and specific DNA clones within the library will 
10 carry a single copy of the targeting sequence that is 
disrupted by the presence of the selectable marker, and 
will migrate at a specific and predictable position after 
restriction enzyme digestion and Southern blot analysis, 
using either ARG4 or the targeting sequence as a radio- 
15 labeled probe. This is in contrast to the process 

described in Example I, in which the correctly targeted 
DNA clones have two uninterrupted copies of the targeting 
sequence flanking the selectable marker. 

Figure 3 illustrates the selection by homologous 
20 recombination of a DNA clone from a DNA YAC library using 
one-step gene disruption. The thin line represents an 
insert of DNA in the form of a yeast artificial chromo- 
some (YAC). The solid box is the DNA fragment, a 
sequence of DNA constituting a portion of a DNA YAC clone 
found in the library that is homologous to the targeting 
sequence. In the diagram, the targeting sequence (solid 
boxes) has been modified by the insertion of the yeast 
ARG4 gene (open box) . The remaining portions of the DNA 
YAC are comprised of the YAC vector arms: the thick 
lines represent plasmid sequences for replication and 
selection in bacteria. The shaded boxes represent 
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genetic markers used for selection in yeast (yeast 
selectable markers URA3 and TRPl). The solid arrowheads 
and circle represent telomeres (TEL) and a centromere/ 
yeast replication origin (CEN/ARS) , respectively. Figure 
3a depicts the targeting molecule aligning with the 
target sequence on the DNA YAC . Figure 3b depicts the 
product of homologous recombination between the targeting 
and target sequences, with the targeting sequence having 
replaced the target sequence. 

As a specific example of thi.s embodiment of the •• 
basic concept, the 1.9 kb (Hindlll) 5' epsilon- globin 
fragment (see Example I) is subcloned into the Hindlll 
site of PUC18 (ATCC #37253). The resulting plasmid is 
digested with Apal, dropping out a 0.26 kb Apal fragment 
from the central portion of the 5' epsilon- globin insert. 
The 3' Apal overhangs are made blunt with T4 DNA poly- 
merase, and the resulting material is ligated to the 
purified ARG4 2.0 kb Hpal fragment (Beacham. I.R.. Gene. 
29:271-179. 1984). The resulting plasmid. with ARG4 
disrupting the 5 ' epsilon- globin sequence, is digested 
with Hindlll and transformed into the DNA YAC library, as 
described in Example 1. The specific example presented 
results in the replacement of 0.26 kb of the 5' epsilon- 
globin DNA with the ARG4 sequence, since Apal is not 
unique in the targeting sequence. For enzymes that are 
unique in the targeting sequence, however, the result 
will be a simple insertion. 
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EXAMPLE III HOMOLOGOUS^ECOMMNATION^ 

g4LKING_UTILI_ZING TWO YAC LIBRARIES 
^ ' Construction of Yeast Artificial Chromosome_(YACj^ 
Libraries 

S A.l) Saccharomyces Cerevisiae 

li££t^tI£l2«Cons traction 
The construction of a strain of S. cerevisiae 
carrying chromosomal deletions of each of the four 
genes used as selectable markers on the four YAC 
10 ' vectors described can be carried out as follows: 

A . 1 . a ) Deletion of ARG4^ 

The internal 2 . 0 kb Hpal fragment carrying the 
entire structural gene and regulatory elements for 
the yeast argininosuccinate lyase gene (ARG4) is 
deleted from a plasmid consisting of the 11 kb BamHI 
fragment isolated from p(SP013)2 (Wang, H-T., et 
al . , Molecular £ nd_ Ce 11 u 1 a r _ B i o 1 o^y 7:142S-1435, 
1987) inserted into the BamHI site of pUC19 (ATCC 
5^37254), by digestion with Hpal and relegation of 
the DNA under dilute conditions (1 pg/ml) . The 
resulting plasmid is digested with BamHI and intro- 
duced into an S. cerevisiae strain carrying the 
wild- type alleles for ARG4 , TRPl, URA3 , and LEU2 , 
and carrying any non-reverting his3" allele. The 
transformation is carried out in conjunction with 
any plasmid carrying yeast GEN and ARS elements, and 
the yeast HIS3 gene, using standard co-trans- 
formation conditions (Ausubel, F.M. et al. , Current 
Protoc ols in Molecular Biology, Chapter 13, Greene 
Publishing Associates and Wiley- Interscience , New 
York, 1989). A useful plasmid for this purpose can 
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readily be constructed by subcloning the 1 . 7 kb 
BamHI fragment from pRB15 (ATCC #37062) into the 
BamHI site of YCp50 (ATCC #37419). His"^ cells are 
screened for arginine auxotrophy by replica plating 
onto CM -arginine- plates . His"*" arg* cells are grown 
in the absence of selection for HIS3. and single 
colonies are isolated and screened for histidine 
auxotrophs. DNA from his* arg' colonies is prepared 
and analyzed by restriction enzyme and Southern blot 
analysis to identify tr ansf ormants- carrying the ARG4 
deletion (arg4A). This protocol is used to generate 
strain MGD131-10c used in Example I above. 
A . 1 . b ) °ll£ii2!}_£l_TR P lj_ 

In a yeast strain of opposite mating type as 
that used above, also , carrying mutant alleles for 
LEU2 and URA3 (leu2", ura3'). an identical procedure 
is carried out. but using a linear fragment of DNA 
carrying a deletion of the yeast gene for N-(5'- 
phosphoribosyl) - anthranilate isomerase (TRPl) . 
This is accomplished by subcloning the BamHI-XhoI 
fragment from pBR322 - Sc4120 (Stinchcomb. D.T., et 
a 1 . , J£HI l_o f _Mo 1 e c u 1 a r_B i o 1 o£2 . 158 : 1 5 7 - 1 7 9 , 
1982) into BamHl-XhoI cut pGEM7 , (Promega, Madison, 
Wisconsin) followed by deletion of the 1.2 kb EcoRI 
fragment containing TRPl and ARSl. The resulting 
plasmid, pK2, is digested with BamHI and Xhol and 
co-transformed with a HIS3-CEN-ARS plasmid, like 
that described in A.l.a) above, selecting for 
histidine prototrophs. and following the strategy 
outlined in A.l.a.) above to identify cells carrying 
the TRPl deletion (trplA). These cells are mated 



wo 93/03183 PCr/US91/08679 



-52- 



with cells carrying arg4A, and diploids heterozygous 
for the two deletions are isolated. This strain, 
TD7-16d, is sporulated, subjected to tetrad 
analysis, and spores with appropriate phenotypes are 

5 analyzed by restriction enzyme and Southern blot 

analysis to identify a strain with both the arg4A 
and trplA alleles (IV-16d used in Example I above). 
The genotype of TD7-16d is: a/a, arg4A/ARG4, 
LEU2/leu2-3,112, ura3 - 52/URA3 , trpl - 289/trplA , 

10 ade2.101/ade2-101, cyh^/cyh^, (CyH2/cyh2). 

his3Al/his3Al 

A . 1 . c ) Deletion of__L£U2 and URA32 

Strain TD7-16d is used as the recipient in 
additional co - transformation experiments, first with 
15 a linear DNA fragment carrying an internal deletion 

of the 1.3 kb HincII-AccI fragment corresponding to 
the yeast ^ - isopropylmala te dehydrogenase gene 
(LEU2), and subsequently with a linear fragment 
carrying an internal deletion of the 0.85 kb 
20 Pstl-Nsil fragment corresponding to the yeast 

orotidine-5 ' -phosphate decarboxylase gene (URA3). 
The plasmids YEpl3 (ATCC #37115; Broach, J.R., et 
^1' ' Gene, 8:121, 1979) and YIp30 (ATCC #37109; 
Botstein, D., et al. Gene, 8:17- 24, 1979) are used 
25 as sources for constructing deletion derivatives of 

the LEU2 and URA3 genes, respectively. A diploid 
that is heterozygous for all four deletions is 
sporulated, subject to tetrad analysis, and screened 
for haploid colonies that have the minimal genotype 
30 MATa arg4A trplA leu2A ura3A. This is the recipient 

strain for constructing Libraries 1 and 2. (See 
Figure 4.) 
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A . 2 ) Con s t rue t i o n_o f _Ye a s t_Ar t i f i c i a 1 
^lH°gp.g.og6_ (YAC) Vecto rs : 
The construction of an artificial chromosome 
requires that sequences capable of stabilizing the 
ends of linear DNA molecules (telomeres or TEL 
elements) be ligated to each end of the DNA chosen 
for cloning. In addition, each end needs to carry: 
1) a yeast gene that can be used for genetic 
selection in the initial construction of the library 
and for subsequent use as a selectable marker for 
use in selecting clones out of a library by homo- 
logous recombinat ion. and 2) sequences that allow 
replication in coli and confer antibiotic resis- 
tance in E. coli (selectable markers). Each end 
should also carry a sequence that functions as an 
initiation site for DNA replication (an ARS 
element). Finally, one and only one. of the two 
ends must carry a sequence that functions as a 
centromere in yeast (a CEN element) . 

To ensure that each linear DNA molecule trans- 
formed into yeast has two different ends (only one 
of which caries a CEN element), to facilitate the 
identification and recovery of each end uniquely, 
and to generate the two YAC libraries (Library 1 and 
Library 2), a total of four different ends are 
needed, utilizing four different yeast genes and 
four different antibiotic resistance markers. All 
of the various elements described above are ligated 
together in specific arrangements to generate yeast 
artificial chromosome vectors which can be propa- 
gated and manipulated in E. coli. To minimize the 
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possibility of homologous recombination between the 
ends of artificial chromosomes in Library 2 and 
targeting plasmids isolated from Library 1, the 
bacterial origins of replication on the vectors used 

5 in the construction of each individual library are 

from different sources. So that the final vectors 
are compact, easy to manipulate, and unlikely to 
rearrange by virtue of the duplicated bacterial 
origins of replication, each of the four ends is 

10 maintained as a different plasmid in bacteria, in 

contrast to the invention described in U.S. Patent 
No, 4,889,806. 

A . 2 . a ) t r u c t i o n_ o f _ a_CEN j_ AR S_E 1 ^ 

The PstI site of pUC19 (ATCC #37254) is removed 
by blunting with T4 DNA polymerase and recirculari- 
zation with T4 DNA ligase. The resulting plasmid 
(pCU19/Pst* is cut with EcoRI and Smal and the 3.1 
kb EcoRI-Smal fragment from A75p9 (carries ARSl, 
TRPl. and CEN3 ; Murray, A.W. and Szostak, J.W., 
Nature, 305:189-193, 1983) is inserted. The 
resulting plasmid (pTlOH) is cut with StuI and 
BamHI, removing the TRPl gene and all CEN3 
sequences. The StuI-BamHI fragment carrying the 
pUC19/Pst" backbone and ARSl is gel purified and 
ligated to a 382 bp Sau3A-ScaI fragment carrying 
CEN3 isolated from A75p9 (Murray, A.W. and Szostak, 
J.W., Nature, 305:189.193, 1983). The resulting 
plasmid (pT12H) carries ARSl sequences from 
positions 829-1453 in the published TRPl sequence 
(Tschumper G. and J. Carbon, Gene, 10:157-166, 1980) 
fused to CEN3 sequences 1-382 (Bloom, K.S. and J. 
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Carbon, Cell. 29:305-317, 1982). with both fragments 
inserted between the EcoRI and BamHI sites of the 
pUC19/Pst' polylinker. 

A . 2 . b ) Cons true tion_of_a_YAC_ARM_VECTOR 

2TKENPA2 

The Sau96 site of pMLC28 (pSDC12 with pUC19 
polylinker; Levinson. A., et al. J^_Mo l^_A2£l^_G en_^ . 
2:507-517, 1984) is removed by blunting with T4 DNA 
polymerase and recircularization with T4 DNA ligase . 
The resulting plasmid (pMLC2 8/Sau ' ) is digested with 
EcoRI and BamHI, and annealed with oligonucleotides 
1 and 2 (Figure 8a), and treated sequentially with 
T4 DNA ligase, T4 DNA polymerase, and T4 DNA ligase. 
The treated molecules are transformed into E. coli. 
and chloramphenicol resistant transf ormants are 
screened for the presence of an Apal site expected 
to be found in recombinant plasmids carrying the 
oligonucleotides. Plasmids which also regenerate 
the EcoRI and BamHI sites are subjected to dideoxy 
DNA sequence analysis. One plasmid with the correct 
sequence (pMLC28/SL) is digested with EcoRI, blunted 
with T4 DNA polymerase, and ligated to the 2.0 kb 
Hpal fragment carrying the yeast ARG4 gene. 
(Beacham, I.R., et al^. Gene. 29:271-279, 1984). 
The resulting plasmid with a single insert of the 
Hpal fragment (pT20) is cut with BamHI and Hindlll, 
and mixed with a purified 0 . 7 kb BamHl-EcoRI TEL 
fragment and the 1.0 kb EcoRI-Hindlll fragment con- 
taining ARSl and CEN3 from pT12H ( SectionA . 2 . a . ) . 
Transformants resulting from this three way ligation 
are screened by restriction enzyme analysis. The 
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correct plasmid (pT21) is digested with Smal and 
BamHI, and ligated to a 1 . 8 kb Smal-BamHI fragment 
derived from coli. The resulting plasmid is 
named pTKENDA. Figure 6a illustrates the plasmid 
5 map of pTKENDA, with relevant features and 

restriction sites. N: Nsil; A: Apal; Sm: Smal; 
B: BamHI; Hd: Hindlll; X: Xhol; R: EcoRI; Xb : 
Xbal; S: Sail (Hindll) ; Sp : Sphl; ARG4 : yeast 
ARG4 gene; Cm: chloramphenicol resistance gene; 
10 ORI(pMLC28): pMLC28 origin of replication; CEN3 , 

ARSl: yeast CEN3 (centromere) and ARSl (replication 
origin), respectively; TEL: se.quence that seeds 
telomere formation in yeast; exR: former EcoRI 
sites; dashed line: stuff er DNA fragment derived 
15 from E_^ £oli. The arrow indicates the direction of 

ARG4 transcription. 

The CEN3-ARS1 element used in pTKENDA is not 
the preferred sequence to use for constructing DNA 
YAC libraries. To convert pTKENDA to the more 
10 preferred derivative, pTKENDA is digested with Xbal 

and treated with the Klenow fragment of E. coli DNA 
polymerase to create a blunt end. This DNA is then 
cut with BamHI, dropping out the CEN3.ARS1 element 
originally derived from pT12H (section A. 2. a) and 
5 the TEL sequence. The 6.5 kb fragment (referred to 

as fragment A in this modification) carrying ARG4, 
the E^^ coli DNA stuffer fragment and the chloram- 
phenicol resistance gene is gel purified. 
Separately, pTKENDA is digested with Hindlll and 
0 BamHI and the 0.7 kb TEL fragment (referred to as 

fragment B in this modification) is gel purified. 
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Plasmid YCpl9 (ATCC #37364) is digested with 
Hindlll, PvuII. and Xbal and the 2.6 kb Hindlll- 
PvuII fragment carrying CEN4 and ARSl is gel 
purified (referred to as fragment C in this modifi- 
cation). Fragments A, B. and C are ligated 
together, transformed into E. coli, and chloram- 
phenicol resistant colonies are screened for 
plasmids with a single copy of fragments A. B. and 
C. The resulting plasmid is pTKENDA2 . 

A . 2 . c ) C ons true tion_of_a_Jf AC_ARM_y ECTOR 
ETKENDB 

The 827 bp EcoRl-PstI fragment from YRp7 (ATCC 
#37060). carrying the yeast TRPl gene, is blunted 
with T4 DNA polymerase and ligated to Hindi cut 
15 pUC19 (ATCC #37254). One plasmid, pT32H . is 

isolated in which the direction of transcription of 
the TRPl gene is directed away from the EcoRI site 
of the PUC19 polylinker. This plasmid is cut with 
EcoRI and BamHI , annealed with Oligos 3 and 4 
(Figure 8a), and treated sequentially with T4 DNA 
ligase, T4 DNA polymerase, and T4 DNA ligase . The 
treated molecules are transformed into E. coli, and 
ampicillin resistant trans formants are screened for 
the presence of an Apal site expected to be found in 
25 recombinant plasmids carrying the oligonucleotides. 

Plasmids which also regenerated the EcoRI site are 
subjected to dideoxy DNA sequence analysis. One 
plasmid with the correct sequence (pT32LH) is 
purified for further use. 

Plasmid PBS/+ (Stratagene Cloning Systems. 
LaJolla. CA) is cut with Aatll and EcoRI and blunted 
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with T4 DNA polymerase to delete the LacZ gene. The 
resulting molecules are circularized with T4 DNA 
ligase and ampicillin-resistant E. coli trans- 
formants are analyzed for the correct deletion 
5 derivative which regenerates the EcoRI site. One 

plasmid (pBSA) is cut with EcoRI and PstI (both of 
which cut within the pBS/+ polylinker) , and ligated 
to the Q.85 kb TRPl EcoRI-PstI fragment from pT32LH. 
Ampicillin-resistant transf ormants from this 
10 ligation are screened by restriction enzyme analysis 

for molecules with the correct structure (pT32BH) . 
pT32BH is then cut with BamHI-XhoI TEL fragment from 
pTKENDA, and transf ormants are screened by 
restriction (Section A.2.b.) enzyme analysis for 
15 molecules with a single insert of the TEL fragment. 

This plasmid, pTSSH, is cut with SphI, blunted by 
treatment with T4 DNA polymerase and recir cularized 
with T4 DNA ligase. The resulting plasmid is pT34H. 
pT34H is digested with SnaBI and BamHI . and ligated 
20 to the 1.2 kb SnaBI-BamHI fragment from plasmid 

pBR:^a (ATCC #39698). The resulting plasmid is 
designated pTKENDB. Figure 6b is a plasmid map of 
pTKENDB, with relevant features and restriction 
enzyme recognition sites: N: Nsil; A: Apal; Sn: 
25 SnaBI; B: BamHI; Hd: Hindlll; X: Xhol; R: EcoRI; 

Xb: Xbal; He: Hindi; Sp: SphI; P: PstI; TRPl: 
yeast TRPl gene; Ap : ampicillin resistance gene; 
ORI(pBS/+): pBS/+ origin of replication; ARSc: 
consensus ARS sequence (TAAACATAAAA ; Braoch. J. et 
^ • t Co ld_S^ing Harbor Symp. Quant. Biol^ 47:1165 
(1983)). TEL: sequence that seeds telomere 
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formation in yeast; exR, former EcoRl site; exP; 
former PstI site; dashed line: staffer DNA fragment 
derived from human beta-globin DNA. The arrow 
indicates the direction of TRPl transcription. 
A . 2 . d ) C on s t r u c t i on_of _a_YA C_Ar m_Ve c t o r 
BTKENDC 

The 622 bp Sall-Hindlll fragment from pACyC184 
(ATCC #37033; Chang. A.C.Y. and Cohen. S.N. Journal 
£l_Ba£teriologir, 134:1141-1156, 1978) is subcloned 
into Sall-Hindlll cut pBS/- (Stratagene Cloning 
Systems, LaJolla. CA) to generate pT40H. The host 
strain XLl-Blue (Stratagene Cloning Systems. 
LaJolla, CA) is infected with wild-type M13 (Bio-Rad 
Laboratories, Rockville Centre. New York) and a 
mixture of wild-type and pT40H phage particles are 
isolated. Cells from the dut'ung" E. coli strain 
CJ236 (Bio-Rad Laboratories. Rockville Centre. New 
York) are infected with this mixture of phage, and a 
mixture of pT40H and M13 s ingle - stranded DNA is 
20 isolated. Oligo 13 (Figure 8c) is used essentially 

as described by Kunkel (Kunkel. T.A. Proceedings_o f 
£l!£_Na t i on a 1_A c ademy_o f _S c i enc e s_y S , 82:488-492. 
1985) to introduce a C to T substitution at the 
XhoII site corresponding to position 1870 of 
pACyci84, to generate pT40/X'H. The 622 bp Sall- 
Hindlll fragment from pT40/X'H is isolated and 
ligated to the 3.6 kb Sall-Hindlll fragment of 
PACYC184 purified by gel electrophoresis. The 
resulting plasmid (pT41H) is cut with XmnI and Styl. 
blunted by treatment with T4 DNA polymerase, ligated 
to EcoRI linkers (CGGAATTCCG) . and cut with EcoRl to 
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generate EcoRI overhanging ends. The 2237 bp EcoRI- 
linked Xmnl-Styl fragment is purified by gel 
electrophoresis , 

BamHI linkers are added on to the 1.1 kb 
5 Hindlll fragment from YIp30 (ATCC #37109) that 

carries the URA3 gene. This fragment is inserted 
into the BamHI site of pBS/+ (Stratagene Cloning 
Systems, LaJolla, CA) , such that the orientation of 
URA3 transcription is directed away from the EcoRI 
10 site in the polylinker. The resulting plasmid is 

cut with Hindlll, blunted with T4 DNA polymerase, 
and recircularized with T4 DNA ligase to remove the 
Hindlll site of the polylinker. The resulting 
plasmid is cut with Nsil and Sail, blunted with T4 
15 DNA polymerase, and recircularized with T4 DNA 

ligase to remove the Nsil, BamHI (3' side of URA3 
only), Xbal, and Sail sites in the plasmid. The 
resulting plasmid is cut with EcoRI and BamHI and 
annealed with Oligos 5 and 6 shown in Figure 8b. 
20 The mixture is treated with T4 DNA ligase, T4 DNA 

polymerase, and again with T4 DNA ligase, and 
transformed into bacteria. Ampicillin-resistant 
transf ormants are screened by restriction enzyme 
analysis for the presence of an Apal site introduced 
with the polylinker, and plasmids that regenerate 
and EcoRI site are subject to dldeoxy DNA sequencing 
to confirm the correct polylinker sequence. This 
plasmid is pURA3LH. 

The host strain XLl-Blue (Stratagene Cloning 
Systems, LaJolla, CA) is infected with wild- type H13 
(Bio-Rad Laboratories, Rockville Centre, New York) 
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and a mixture of wild-type and pURA3LH phage 
particles is isolated. Cells from the dut'ung' E. 
coli strain CJ236 (Bio-Rad Laboratories, Rockville 
Centre. N.Y.) are infected with this mixture of 
phage, and a mixture of pURA3LH and M13 single- 
stranded DNA is isolated. Oligonucleotide 12 
(Figure 8c) is used essentially as described by 
Kunkel (Kunkel, T.A. , Proceedings of the National 
Academy of Sciences (USA), 82:488-492, 1985) to 
introduce a base substitution at the Xholl site at 
position 906 in the published URA3 sequence (Rose M. 
Grisafi. et al.. Gene, 29:113-114). The resulting 
plasmid, pURA3LHX". is cut with EcoRI and BamHI , and 
ligated to the 0.7 kb EcoRI-BamHI TEL fragment from 
15 pTKENDA (Section A.2.b.). The resulting plasmid. 

pT42H, is cut to completion with EcoRI and partially 
with PstI, blunted with T4 DNA polymerase, ligated 
to EcoRI linkers (CGGAATTCCG) , and cut with EcoRI to 
generate EcoRI overhanging ends. The 1.7 kb EcoRI- 
linked fragment is purified by gel electrophoresis 
and ligated to the EcoRI-linked fragment from pT41H 
purified above. Tetracycline resistant trans- 
formants are analyzed by restriction enzyme analysis 
for molecules with a single copy of each fragment in 
25 either orientation. This plasmid is digested with 

BamHI and Smal and the same 1 . 8 kb stuff er fragment 
derived from E^ coli used in the construction of 
pTKENDA is inserted. The resulting plasmid is 
designated pTKENDC. Figure 6c is a plasmid map of 
pTKENDC, with relevant features and restriction 
enzyme recognition sites. N: Nsil; A: Apal; Sm: 
Smal; B: BamHI; Hd: Hindlll; X: XhoII; R: EcoRI; 
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Ah; Ahalll; URA3: yeast URA3 gene; Tc : tetra- 
cycline resistance gene; ORI (pACYC184) : PACYC184 
origin of replication; ARSc: consensus ARS sequence 
(TAAACATAAAA; Broach, J. et__al . . (1983) Cold_S£rin^ 
H a rb o r _ Symp . Quan t^_B i o 1^ , ^7:1165) . TEL: sequence 
that seeds telomere formation in yeast; exS , exM, 
exN. exP, exB , exX: former Styl, XmnI , Nsil, PstI, 
BamHI, and XhoII sites, respectively; dashed line: 
stuffer DNA fragment derived from coli . The 
arrow indicates the direction of URA3 transcription. 
A . 2 . e ) Construction of a YAC_Arm_Vector 
pTKENPp 

PACYC177 (ATCC #37031; Chang. A.C.Y. and Cohen, 
S.N. Journal_of Bacteriology. 134:1141-1156, 1978) 
is cut with Sau96, blunted by treatment with T4 DNA 
polymerase, and the 1.2 kb fragment carrying the 
kanamycin resistance gene is isolated by gel 
electrophoresis. This fragment is ligated to Hindi 
cut pBS/+ (Stratagene Cloning Systems, LaJolla, CA) 
and chloramphenicol and kanamycin resistant clones 
are analyzed by gel electrophoresis for recombinants 
with the kanamycin gene inserted such that the 
direction of transcription is directed away from the 
EcoRI site in the pBS/+ polylinker. The blunt- 
ending of the Sau96 sites and subsequent ligation to 
Hindi cleaved pBS/+ results in Sail sites at the 
left and right junctions. This plasmid is pT50H. 
To remove one of the two inverted repeats flanking 
the kanamycin resistance gene (the 5' inverted 
repeat relative to the direction of transcription), 
pT50H is cleaved with Sail and Dralll and the 1.08 
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kb fragment containing the kanaraycin resistance gene 
is purified, blunt - ended by treatment with T4 DNA 
polymerase, and ligated to Hindi digested pBS/+. 
The resulting plasmid, with transcription of the 
kanamycin resistance gene directed away. from the 
EcoRI site in the pBS/+ polylinker, is pTSOASD. 
pTSOASD is introduced into the host strain XLl-Blue 
(Stratagene Cloning Systems, LaJoHa, CA) . and 
subsequently infected with wild-type M13 (Bio-Rad 
Laboratories, Rockville Centre , New York) ajid a 
mixture of wild-type and pTSOASD phage particles are 
isolated. Cells from the dut'ung' E. coli strain 
CJ236 (Bio-Rad Laboratories, Rockville Centre, N.Y.) 
are infected with this mixture of phage, and a 
mixture of pTSOASD and M13 s ingle - s tranded DNA is 
isolated. Oligonucleotides 14, 15 and 16 (Figure 
8c) are used essentially as described by Kunkel 
(Kunke 1 , T . A . , ) Pr oceedin£s_of _the_Nat ional_Acade 
o|-Sciences_USA, 82:488-92, 1985) to introduce base 
substitutions at two Nsil sites (positions 2203 and 
2469 of the published pACYC177 sequence) and at an 
XhoII site at position 2602 of pACYC177. The 
resulting plasmid, pTSOHX is cut with EcoRI and 
Sphl, blunted with T4 DNA polymerase, and cir- 
cularized with T4 DNA ligase, (regenerating the 
EcoRI site). The resulting DNA preparation is then 
cut with Xbal. This fragment is ligated to the 882 
base pair Accl-Xholl fragment of pACYC177 (which has 
been blunted with T4 DNA polymerase, ligated with 
Xbal linkers (GCTCTAGAGC) , and treated with Xbal to 
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generate Xbal overhangs) carrying the plasmid origin 
of replication, to generate plasmid pT51H (either 
orientation will suffice). 

Plasmid pT52H is constructed by cutting plasmid 
5 YIp33 (ATCC #37064) with Hpal and AccI to release a 

1.6 kb fragment containing the yeast LEU2 gene 
(Andreadis, A., et al. , Cell, 31:319-325, 1982). 
This fragment is blunted with T4 DNA polymerase and 
ligated to pUClS (ATCC #37253) cut with Hindi. The 
10 resulting plasmid is cut with BamHI and Xbal, and 

annealed with oligonucleotides 7 and 8 (Figure 8b) , 
The mixture is treated with T4 DNA ligase, T4 DNA 
polymerase, and again with T4 DNA ligase, and 
transformed into bacteria. Ampicillin resistant 
15 transformants are screened by restriction enzyme 

analysis for the presence of an Apal site introduced 
with the polylinker and plasmids that regenerate a 
BamHI site are subject to dideoxy DNA sequencing to 
confirm the correct polylinker sequence. The 
20 resulting plasmid is pT52LH. pT52LH is digested 

with BamHI and PstI, and the gel purified 1 . 6 kb 
fragment is ligated to pT51H cut with BamHI and 
Pstl. The resulting plasmid, pT53H, is digested 
with Seal and Bglll, and ligated to the double- 
stranded oligonucleotide shown in Figure 8c (oligo- 
nucleotides 9A and 9B) . The resulting plasmid 
(pT53HL) is partially digested with Hindlll, 
followed by complete digestion with Bglll and the 
digestion product corresponding in size to 
linearized pT53HL (approximately 3.7 kb) is 
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purified. This species represents cleavage at the 
adjacent Hindlll and Bglll sites introduced via 
Oligonucleotides 7 and 8 (Figure 8b). Plasmid 
pTKENDA (Section A.2.b and ATCC accession number 
40833) is digested with EcoRl and treated with the 
Klenow fragment of E. coli DNA polymerase to 
generate a blunt end. This DNA is then digested 
with BamHI and the 0.7 kb TEL fragment is gel 
purified. Plasmid YCpl9 (ATCC #37364) is digested 
with Hindlll, PvuII , and Pvul and the 2 . 6 kb 
HihdIli-PvuII fragment carrying CEN4 and ARSl is gel 
purified. The purified CEN4-ARS1 and" TEL fragments 
are ligated to Bglll-Hindlll digested pT53HL and 
transformed in E. coli. Kanamycin resistant trans- 
formants are screened for plasmids with a single 
copy each of the CEN4-ARS1. TEL. and T53HL frag- - 
ments. The resulting plasmid is pT54H. pT54H is 
digested with PvuII and Sad, and ligated to the 1.5 
kb Sad- PvuII fragment lying between positions 
25,881-27,414 on the bacteriophage Lambda (New 
England Biolabs, Beverly, MA) map. The resulting 
plasmid is pTKENDD. Figure 6d is a plasmid map of 
pTKENDD with relevant features and restriction 
enzyme recognition sites. N: Nsil; A: Apal; B: 
BamHI; He: Hinclll; Pv: PvuII; P: Pstl. S. 
Sall(HincII); Hd: Hindlll; X: XhoII; Xb : Xbal; 
Sa: Sad; Ah: Ahalll; LEU2: yeast LEU2 gene; Km: 
Kanamycin resistance gene: ORI (pACYC177) : pACYC177 
origin of replication; ARSc: consensus ARS sequence 
(TAAACATAAAA; Broach. J. et al.. (1983) Cold_S£rin5 
S5lkor_S2m£^_2uant^_Biol^ 42: 1165) . CEK4/ARS1 : 
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CEN4/ARS1 fragment from YCpl9 (see text); TEL: 
sequence that seeds telomere formation in yeast; 
exR, exPv, exN, exX: former EcoRI, PvuII, Nsil and 
XhoII sites, respectively; dashed line: stuffer DNA 
5 fragment derived form bacteriophage Lambda. The 

arrow indicates the direction of LEU2 transcription. 

A . 3 ) Construction of Yeast_Artif icial 
Chromosome _ (YAC) Libraries 
DNA from human white blood cells is prepared 
and partially digested with restriction endo- 
nucleases, essentially as described (D.Burke, Ph.D. 
thesis, Washington Univ. , St, Louis, MO (1988)). 
DNA (with a desired average size of greater than 1.5 
megabases) is partially digested with Apal, Nsil, or 
^5 any enzyme that leaves a blunt end. To construct 

Library 1, plasmids pTKENDA2 and pTKENDB are used. 
pTKENDA2 is cleaved with BamHI and either Apal, 
Nsil, or Smal to release the stuffer fragment. 
pTKENDB is cleaved with BamHI and either Apal, Nsil, 
or SnaBI to release the stuffer fragment. For the 
construction of Library 2, plasmids pTKENDC and 
pTKENDD are used, pTKENDC is digested with BamHI 
and either Apal, Nsil, or Smal to release the 
stuffer fragment. pTKENDD is digested with Sad and 
25 either Apal, Nsil, or PvuII to release the stuffer 

fragment . 

Each vector is treated with calf intestine 
alkaline phosphatase under conditions recommended by 
the supplier and purified by phenol extraction and 
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ethanol precipitation. For each library, 50 ng of 
human DNA and 25 pg of each vector in each pair 
(pTKENDA2-pTKENDB or pTKENDC-pTKENDD) are mixed and 
ligated using T4 DNA ligase for 2 days at 12"C, in a 
5 ligation buffer recommended by the enzyme supplier. 

The ligated DNA is size fractionated by Field 
Inversion Gel Electrophoresis (Carle et al , , 
Science. 232; pp 65-68. 1986) in low-gelling temper- 
ature agarose (FMC Corp., Rockland, Maine), or CHEF 
10 gel electrophoresis (Chu et al.. 1986 op cit) and 

the portion of the gel containing DNA of 250-450 kb 
is excised and equilibrated with TE buffer + 45mM 
NaCl. 

A . 3 . b ) IlSSlI £I2£ ti2n_2i_Ye a s t_S£h e r £21 a s t s 
I£illl_5NA_Li£ated_to_YAC_Vector_Arms 
SDd_Selection_of_yeast_Cells_carryin£ 
Ar t i f ^c i a ]._Chr omo s ome s 
DNA prepared as described in section A. 3. a. can 
be used to transform a haploid S. cerevisiae strain 
carrying chromosomal deletions for ARG4 , TRPl , URA3 , 
and LEU2 to arginine and tryptophan prototrophy 
using human DNA ligated to pTKENDA2 and pTKENDB , es- 
sentially as described by Burgers and Percival 
(1987), with the following modifications: 10 - 20 
/il of the low-melt agarose carrying the DNA is 
melted at 68-C for 3 to 5 minutes. Carrier DNA 
(sheared salmon sperm or calf thymus DNA) is added 
to the cells to a final concentration of 30 - 40 
/ig/ml immediately before 200 ,.1 of cells is added to 
30 the melted gel slice. 
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For plating and selection of yeast cells 
carrying artificial chromosomes, transformed cells 
are mixed with top agar (IM sorbitol, 2% dextrose, 
0,5% ammonium sulfate, 0.17% yeast nitrogen base 
(Difco), 2.5% Bacto-agar (Difco) , 0.005% adenine 
sulfate, and supplemented with uracil and all of the 
amino acids listed in Table 13.1,1 of Ausubel et a 1 . 

(Ausubel et al. , £}^LI.m^^lL2l^l2,]Ll^kll-^9.lL^£:^l^L 
Biology, Greene Publishing Associates and Wiley- 
Interscience . New York, 1987) at the listed concen- 
trations, but omitting arginine and tryptophan for 
selection. The mixture of cells and. top agar is 
poured onto the surface of agar plates made 
identically to the top agar except that the final 
concentration of agar is 2% in the plates. Plates 
are incubated at 30*C for 5-7 days. 

To construct Library 2. human DNA ligated to 
pTKENDC and pTKENDD are used to transform the same 
1- cereyisiae strain to uracil and leucine proto- 
tropy. Top agar and plates are prepared as des- 
cribed above, but lacking only uracil and leucine. 
A . 3 . c ) Pooling of Clones 

Yeast colonies growing on plates selective for 
markers present on artificial chromosomes are 
transferred using sterile toothpicks into individual 
wells of 96-well microtiter plates filled with 200 
ftl of selective media. Plates are incubated with 
shaking at room temperature for 2 days and stored at 
4*C for up to one week. A fully representative YAC 
library of the human genome should be comprised of 
50.000 independent clones, assuming an average clone 
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size of 300 kb . This number of clones would fill 
521 microtiter plates and is stored as 10 separate 
subpools. When approximately 52 plates are filled, 
100 fil from each well is withdrawn, pooled, and 

5 thoroughly mixed with an equal volume (approximately 

500 ml) of 30% sterile glycerol. The cell density 
of the cells in glycerol should be about 2.5 x 10^ 
cell/ml, and can be adjusted to this density by 
counting cells prior to glycerol addition. The 

10 pooled cells are then aliquoted into microcentrifuge 

tubes in volumes of 0.1 to 1 ml , set on dry ice to 
quick freeze, and stored at .70*C. This is repeated 
for each of the 10 separate subpools. 

^ • ) Il^Illl 2Iina t i on^o f _P o o 1 e d_L ib r a^ 

^5 IillSl^_£B^_ S e 1 e c t i o n_ o f _ S £ e c i f i c _ A r t i f i c i a 1 

1 one s 

The isolation of DNA YACs by homologous 
recombination is illustrated in Steps 1 and 2 of 
Figure 4. 

B . 1 . ) ^£I!£iIH£t i on_o f _ th e^Tar^e t in£_P 1 

The desired fragments of human DNA (the 
targeting sequences), previously identified as being 
unique or at low copy number in the human genome are 
substituted for the TEL and stuffer domains of 
25 pTKENDC. 50 pg of the resulting subclones are 

prepared and digested to completion with a 
restriction endonuclease which generates a linear 
molecule harboring a double-strand break or gap in 
the targeting sequence, in such a manner that at 
least 150 base pairs, but possibly less, of 
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targeting DNA remains on either side of the break or 
gap, and the pTKENDC vector backbone is intact and 
contiguous with the targeting DNA. The digested DNA 
is purified by phenol extraction and ethanol pre- 
5 cipitation and resuspended in 20 pi. 

B . 2 . ) Transformation of YAC_Library 1 with 

the_TargetingPlasmidand_S elect i 
gf Clones Homogolous to the Targeting 
Sequence 

IQ 0.1 ml of each of the 10 subpools are combined 

in 100 ml CM -arg, trp selective media supplemented 
to 0.05X YPD. Cells are grown overnight with 
vigorous shaking at 30C to a density of 2 x 10^ 
cells/ml. Cells are prepared for transformation by 

15 the lithium acetate method (Ito et al. , 1983) 

essentially as described (Ausubel et al. , Current 

Protocols in Mole cu_lar__Bio logy ^ Greene Publishing 

Associates and Wiley - Interscience , New York, 1989), 

g 

and split into six 200 f^l aliquots at 2 x 10 
20 cells/ml- 50 fig of each of the linearized targeting 

plasmids (in 20 pi) is mixed with 10 pg (2 pi) 
sonicated calf thymus DNA and added to a 200 pi 
aliquot of cells. After transformation, cells are 
spread onto the surface of CM-arg, trp, and uracil 
25 agar plates and incubated at 30*C for 3-5 days. The 

omission of uracil from the media selects for cells 
that have stably integrated the targeting plasmid 
derived from pTKENDC. 
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C ) Ana l^£i s_o f _C 1 o n e s 

C . 1 ) ll-B^^fil£i£B_A2a 1 i s_o f _C lone s 

Yeast colonies prototrophic for arginine, 
tryptophan, and uracil are candidates for clones 
5 carrying the targeting plasmid integrated into a 

human DNA YAC with a region of identity to the 
targeting sequences on the targeting plasmid. 
Colonies in which the targeting plasmid integrated 
into a YAC are identified by a marker segregation 
10 assay. The loss patterns of the three, markers are 

analyzed in cells derived from the selected clone 
which have lost the YAC after growth on non- 
selective media. Cells are patched onto YPD plates 
and grown non-selectively for two days, replica 
15 plated onto a second YPD plate and grown for another 

two days. Cells from the second YPD plate are 
struck-out for single colonies on a third YPD plate. 
After three days, the plate with single colonies is 
replica printed onto a CM -arginine, tryptophan 
20 plate, and a CM -uracil plate. Clones in which the 

targeting plasmid is integrated into a YAC are 
identified by their characteristic pattern of 
co-loss of all three markers. In these cases all 
colonies that are auxotrophic for arginine and 
tryptophan (colonies that lost the markers identi- 
fying the YAC) are also auxotrophic for uracil. 
C.2) ^lstriction_En22me_and_Sout^ Analysis 

of Clones 

Total DNA is prepared from yeast colonies proto- 
30 trophic for arginine. tryptophan, and uracil. The DNA is 
digested with the same restriction enzyme used to 
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generate the doub le - strand break in the targeting 
sequence. 1 of the digested DNA is subject to agarose 
gel electrophoresis and Southern transfer and probed with 
32-P labeled DNA corresponding to the fragment of the 

5 URA3 gene carried in pTKENDC. As a control, 1 ng of the 
digested plasmid generated in B.l) above is run alongside 
the yeast DNA samples. A correctly targeted event is 
characterized by a band on the autoradiograph that 
migrates exactly the same distance as the pure, 

10 linearized targeting plasmid. 



C . 3 ) Rescue of Clone Termini_to_£ener ate_Labeled 

Z£2k^£^-g^^:thern Blot Analysis to Identify 
Sequences that ^^^_SiriglB_Co^2X^lIL^lhl-9,^22Jll 
Under Analysis^ ^gggj: B^H£Il_of_J£jl£ 
1 5 Q^l^n^gation of Cloned_Inserts_Rela^ive_to 

y££££i_Arms_^_^eneration_of_a_Targetin^ 
f rom__Clone Termini , and Transf ormation_Into 
Pooled Library 2 
The YAC cloning vectors pTKENDA2, pTKENDB , pTKENDC, 
20 pTKENDD have been designed specifically to facilitate 

the rescue of cloned DNA from the ends of DNA YACs by 
simple microbiological techniques. One or more 
recognition sites for restriction enzymes that cut 
mammalian DNA relatively frequently (approximately once 
25 every 0.5-1,5 kb) are positioned at the junction between 

the bacterial plasmid replicon and the yeast telomere 

(TEL) or yeast replication origin (ARS) and centromere 
(CEN) sequences. For any one of the four ends, 
recognition sites for a subset of such enzymes are not 
30 found at any other position in the plasmid replicon or 
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the yeast selectable marker on that end, such that 
cleavage of total yeast DNA isolated from cells carrying 
a particular DNA YAC with one of these enzymes rescues 
(as illustrated in step 3 of Figure 4) DNA from the 
5 cloned insert covalently linked to the yeast selectable 
marker and bacterial replicon, but free of yeast chromo- 
some replication and stability elements (telomeres, 
centromeres, and yeast replication origins). This 
"rescued" DNA is used as the targeting plasmid for the 
10 second DNA YAC library. Column 2 of the Table (RESCUE 
SITES) lists the restriction enzymes useful for rescuing 
cloned DNA adjacent to each of the four ends in the two 
DNA YAC libraries. Column 3 (ADDITIONAL ENZYMES) lists 
some of the additional enzymes that can be used in con- 
15 junction with the enzymes listed under RESCUE SITES in 
the event that a RESCUE SITE enzyme rescues a very long 
sequence containing a repetitive DNA element that might 
prevent the clone from being useful for selecting DNA 
YACs by homologous recombination. 
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YAC END 



TABLE 

RESCUE SITES ADDITIONAL ENZYMES 



pTKENDAZ 



Hindi (1433) 
Hindlll (1844) 
SphI (4522) 



10 



PstI 

Xhol 

EcoRI 

BamHI 

Kpnl 

StuI 

Avail 

Hpal 



(3169) 

(21462) 

(2669) 

(5604) 

(8902) 

(3872) 

(790) 

(4240) 



pTKENDB 



Hindi (1433) 
EcoRI (2669) 



15 



Xhol 

Tthllll 

Styl 

BamH 

Kpnl 

StuI 



(21462) 

(1070) 

(785) 

(5604) 

(8902) 

(3872) 



Hpal (4240) 

pTKENDC Ahalll (1192) Tthllll (1070) 

20 BstYI (930) Xhol (21462) 

EcoRI (2669) BamHI (5604) 

Kpnl (8902) 
Hpal (4240) 



pTKENDD 



25 



Ahalll 

BstYI 

BamHI 



(1192) 

(930) 

(5604) 



HgiAI (1348) 
Hpal (4240) 
SphI (4522) 



The numbers in parentheses represent the average number of 
base pairs between restriction sites calculated for 
mammalian DNA. 
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The recovery, analysis and use of clone termini for 
recombination walking is illustrated in Steps 3-6 of Fig. 
4, U: yeast URA3 gene; X: restriction enzyme cleavage 
site used to make targeting break; striped box: 
targeting sequence; thick lines: plasmid sequences for 
propagation and selection in E. coli; Ap : ampicillin 
resistance gene; Cm: chloramphenicol resistance gene; T: 
yeast TRPl gene; A: yeast ARG4 gene; solid circles and 
horizontal arrowheads: yeast centromere/replication 
origins and telomeres, respectively; thin lines: cloned 
human DNA in Library 1; Y: restriction enzyme cleavage 
sites used for end-rescue; L: yeast LEU2 gene; Km: 
Kanamycin resistance gene; Tc : tetracycline resistance 
gene; Z: restriction enzyme cleavage site used to make 
targeting break in end-rescued DNA; thick shaded line: 
cloned human DNA in Library 2. The thin line in Library 
2 DNA represents a sequence homologous to end-rescued DNA 
from Library 1 . 

The remainder of the discussion will relate to 
isolating (rescuing) the left-hand end of the YAC . but 
the principles can be extrapolated for homologous 
recombination walking using any of the four ends in the 
two DNA Libraries. The vertical arrows marked "Y" can 
represent the positions of Hindi sites lying at various 
positions throughout the human DNA (for mammalian 
genomes. Hindi sites have an expected distribution of 1 
site/1.4 kilobases). The vertical arrow on the extremV 
left side indicates the position of a Hindi site that " 
separates the TEL element from the TRPl-pBSA element. 
Cleavage of total DNA from the yeast strain carrying the 
YAC illustrated will release the TRPl-pBSA fragment from ~ 
the TEL sequence on the left side, but the right side 
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will remain attached to a fragment of cloned DNA 

extending to the first Hindi site within the insert. 

The total DNA is ligated under conditions which promote 

circularization of fragments. A fraction of this DNA is. 

used to transform bacterial cells to isolate ampicillin 

5^ resistant plasmids. 

Approximately 60 /ig of plasmid DNA is purified, and 

several micrograms are digested with Hindi and the 

enzyme used to digest the genomic DNA constituting the 

library (SnaBI, Apal, or Nsil) . If Library 1 was con- 

structed by cleaving genomic DNA with Smal and ligated to 

the SnaBI digested pTKENDB, then an enzyme other than 

Smal or SnaBI which flanks the cloning site must be used 

(for example, Apal or Nsil), The digest is fractionated 

on an agarose gel and the non-YAC vector fragment (the 

15 rescued insert) is purified and a fraction is labeled 
32 

with -phosphorus or chromogenic nucleoside triphos- 
phates. This DNA is used in three different ways: 
1. The DNA is cut with a selection of restriction 

enzymes that are known not to cut within the TRPl 
20 pBSA sequence. (ADDITIONAL ENZYMES in the Table 

among others can be used) . The digestion products 
are analyzed by gel electrophoresis to identify 
restriction enzymes which will cut the cloned DNA 
isolated from the end of the YAC, 
25 2. The labeled DNA is used to probe a Southern blot 
filter of human and yeast DNA to determine if the 
end of the YAC corresponds to a single copy sequence 
in the human genome, or if it is homologous to the 
yeast genome. Human sequences that are single copy 
30 or low copy and not homologous to yeast DNA are 

preferred for targeting. 
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3 . The labeled DNA is used to probe a dot-blot, in 

which total DNA from yeast cells carrying YACs have 
been isolated and fixed to a Nylon membrane. The 
membrane is spotted with DNA from the YAC that the 
labeled DNA is derived from (YAC-Z), the YAC over- 
lapping with YAC-2 which is used to isolate YAC-2 in 
the previous recombination selection step (YAC-Y) , 
and the YAC overlapping with YAC-Y which was used to 
Isolate YAC-Y in the previous recombination 
selection step (YAC-X) (i.e., the last three YACs 
isolated in the walk]. Hybridization only to the 
YAC from which is derived (YAC-2 in this case) 
indicates that the TRPl-pBSA end of YAC-Z extends in 
the correct direction, away from the YACs Y and X. 
This is confirmed by a similar analysis with the 
other end of YAC-Z^ which must hybridize with YAC-Z 
and YAC-Y and/or YAC-X. 

A targeting plasmid meeting the criteria outlined in 
2) and 3) above is cleaved with an appropriate restric- 
tion enzyme (identified from 1 above) and as denoted as Z 
in Figure 4), and used as the targeting plasmid to 
isolate clones from Library 2, as described in Section 
B.2 above. 

15AMPLE_IV MethodsforPrjBve g ting_ the _ Occur r e n c e 

£f^Re£e t it iy e_I n t e r s p e r s e d DNA_at_DN A 

The vectors described in Example III incorporate 
novel features that are specifically designed to 
facilitate chromosome walking. First, the two ends of 



wo 93/03183 



PCr/US91/08679 



.78- 



the artificial chromosome are derived from two different 
plasmids, each with its own sequence to seed telomere 
formation in yeast, a bacterial origin of replication, a 
gene far resistance ta an antibiotic for selection in E. 
coll , and a selectable gene for clone selection in yeast. 
This system allows either end of the YAC to be isolated 
as a bacterial plasmid for amplification and use in each 
walking step, as opposed to the possibility of isolating 
only one end with existing YAC vectors. 

In the preferred embodiment of any walking strategy, 
the extreme end of a clone is used as a probe to isolate 
overlapping clones in the walk. The usefulness of such a 
probe is limited by the presence of repetitive DNA which 
may be homologous to thousands of clones within the 
library. Members of the class of DNA sequences termed 
highly repetitive interspersed are found at thousands of 
discreet locations throughout the human genome. 
Specifically, a member of the Alu family of repetitive 
DNA sequences is found, on average, spaced at 1 to 3 
kilobase intervals throughout the genome (Moyzis, R.K., 
Genomics, 4:273-288, 1989). 
The methods and vectors described in Example III 
have been designed to minimize the occurrence of 
repetitive DNA at the terminus of the DNA clone inserts 
in a human DNA YAC vector library. The first feature 
incorporated into the vector library design is the use of 
a specific set of restrxclfiorr ehdonucleases to cleave* 
human DNA. Numerous DNA sequences from the Alu and LI 
family of repetitive DNA were analyzed using computer 
programs that identify recognition sites for restriction 
endonucleases . The results of this analysis revealed 
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that recognition sites for the restriction enzymes Apal , 
Hsil, and Seal are not found in the published consensus 
sequences for any of the Alu subfamilies, and are found 
only rarely in sequenced members of the LI family (of 
approximately 30,000 base pairs of sequences LI DNA 
analyzed, there were only five sites for the three 
enzymes listed above; 23 sites would be expected based on 
the dinucleotide frequencies found for human DNA). These 
two families alone account for approximately 10% of the 
mass of the human genome, indicating that as many as one 
in ten clone ends (1 in 5 clones) may terminate within 
one of these repetitive sequences. By using the enzymes 
disclosed above to cleave human DNA, one creates an 
inherent bias against the occurrence of these two 
repetitive sequences at the ends of clones. 

The second feature incorporated into the design of 
the YAC cloning vectors to minimize the occurrence of 
repetitive DNA in targeting probes used for walking is 
limiting the size of the DNA probe fragment rescued from 
the DNA clone end. Smaller DNA fragments have a lower 
probability of containing repetitive DNA. The vectors 
described in Example III have been designed to rescue 
fragments of human DNA on the order of 1-2 kb in length 
by a single restriction enzyme cleavage of the YAC clone. 
This is accomplished by the insertion of a polylinker 
carrying recognition sites for multiple restriction 
enzymes which cut. on the average, once every 0.5 - 1.5" 
kb. When total DNA from yeast carrying the YAC is cut 
with one of these enzymes, a fragment of DNA containing a 
plasmid origin of replication and a drug resistance 
marker (for propagation and selection in E. coli) , as 
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vell as a gene for selection In yeast, and approximately 
1-2 kb of human DNA will be released. This fragment can 
be circularized and transformed into bacteria. As 
expected, the recognition sites for enzymes that are most 

5 useful for this step are found within several of the 
elements used in the construction of the proposed YAC 
cloning vectors* In vitro mutagenesis to delete restric- 
tion enzyme cleavage sites, along with the judicious 
choice of combinations for the two plasmid replication 

10 origins, the four drug-resistance markers, and the four 
yeast selectable markers is used to create vectors 
lacking the frequent-cutting res trie tion.- enzyme cleavage 
sites listed in the Table (Rescue Sites). 

EXAMPLE_V Use of Terminal Fra gments J)erived_from 

15 Yeast A rtifi cial Chromosome Clones _for_ the Isolation 

of Clones Knovn_to be Present in a Yeas t_ Artificial 
Chromosome Library--A Model System to Test the 
Feasibility of Library Screening by Homologous 
Re combin ation 

20 used homologous recombination screening to 

extract a clone from the library that was known to exist 
within the library. Since the vector arm containing the 
TRPl gene in YACs constructed with pYAC4 contains a 
plasmid replicon and a selectable marker (the beta- 

25 lactamase gene conferring ampicillin resistance) , the 
technique of "plasmid rescue" was used to isolate 
terminal fragments from two YACs constructed in the 
vector pYAC4. The restriction enzyme Xhol cleaves at a 
single site within the TRPl vector arm, at the junction 

3Q between the telomere and pBR322 sequences. Complete 
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30 



digestion of YAC DNA with Xhol should produce a restric- 
tion fragment devoid of telomeric sequences, containing a 
functional plasnid replicon and Amp^ marker, and har- 
boring a segment of human DNA that was adjacent to the 
vector arm in the original YAC clone and extends to the 
5 terminal Xhol site in the human DNA insert. 

A group of 161 YACs within the library were con- 
structed using the host yeast strain MGD131-10c (genotype 
a leu2-3.112 ADE2 cyhl"" hisAl trpl-289 agr4A ura3-52). 
Total DNA from two clones in this group was digested with 
10 Xhol, ligated under dilute conditions to promote intra- 
molecular circularization, and transformed into E. coli 
(all steps carried out essentially as described in 
Ausubel et a 1 . . 1988 [above]. Plasmid DNA was isolated 
from ampicillin resistant colonies and subjected to 
15 restriction enzyme analysis. One human DNA fragment from 
each of the two rescued plasmids was subsequently blunt- 
ended by treatment with T4 polymerase and ligated into 
the Smal site of pl84DLARG. The fragments, lOB and 8A. 
are 1 and 4 kb fragments, respectively, of human DNA 
lying adjacent to the TRPl vector arms in two different 
YACs. The resulting constructs (plasmids pl84-10B and 
P184-8A) were digested with a number of restriction 
enzymes which do not cleave pl84DLARG to identify an 
enzyme that would cut within the human DNA to promote 
targeting. 20 pg of each construct was digested with tHe 
appropriate targeting enzyme- and used for library " 
screening, essentially as described in Example 1. 
Fragment 8A contains a single Kpnl site lying 2-8 Kb from 
one end and this enzyme was used to introduce a unique 
double strand break within the inserted sequence in 
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pl84-8A. Fragment lOB contains a single Avail site lying 
0.5 Kb from one end and this enzyme was used to introduce 
a unique double strand break within the inserted sequence 
in pl84-10B. 

Eleven arg"*" colonies resulting from screening with 
clone 8A were isolated and analyzed. Similar to strain 
IV-16d (Example 1 and ATCC Accession No. 74010) strain 
MGD131"10c carries a 2 kb deletion encompassing the 
entire ARG4 gene. However, the two strains differ with 
regard to their LEU2 genotype; IV-16d is leu"^ and MGD131- 
10c has a leu phenotype. Seven of the eleven colonies 
displayed a leu phenotype. suggesting that they indeed 
represented independent isolates of the original YAC from 
which clone 8A was derived (a very strong possibility 
since strain MGD131-10c is the host for only 161 out of 
the 11,625 YACs (1.4%) in the library). Seventeen arg"*" 
colonies resulting from screening with clone lOB were 
isolated and analyzed. Three of the 17 colonies dis- 
played the leu' phenotype. The presence of the leu' 
marker strongly suggests that these clones represent 
isolates of the original YAC from which clone lOB was 
derived. 

DNA was prepared from each of the seven leu' 
colonies isolated by screening with clone 8A as well as 
one of the leu"*" colonies. DNA was digested with the same 
enzyme used to linearize the transforming DNA molecule 
(Kpnl), A Southern blot of these digests were probed 

with 32-P labeled ARG4 DNA. As described in Example 1, 

homologous integration events should reveal hybridization 
to a single fragment of exactly the same size as the 
linearized transforming DNA molecule (referred to in 
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Example 1 as a Unit Length Linear Fragment, or ULL) . Of 
the eight clones analyzed, all seven in strain MGD131-10c 
(the leu colonies) represent homologous events, while 
the single leu"^ transforoant analyzed (lane 8) does not 
(Figure 9). Thus, seven out of eleven candidate clones 
5 isolated were correctly targeted events. A similar 

analysis was performed on each of the three leu' colonies 
isolated by screening with clone lOB. All three clones 
displayed a ULL upon Southern blot analysis, while 14 
leu transf ormants did not. 
10 To confirm that the three homologous events isolated 

by screening with clone lOB and the- seven homologous 
events isolated by screening with clone 8A represent the 
independent isolates of the same YACs . we have mapped the 
termini of the YACs in these ten clones. Figure 10 shows 
15 the result of this analysis. Three bands are evident in ' 
each lane, corresponding to the ULL. the left arm, and 
the right -arm of the YAC . The bands migrate at identical 
positions in all seven YACs isolated with 8A. and at 
different, but identical positions in all three YACs 
20 isolated with lOB. These data show that the distance to 
the nearest Kpnl site at each end of the seven 8A YACs is 
identical, while the three lOB YACs display similar 
behavior for the positions of their terminal Avail sites. 

IXAMPLE_VI l££«lBinfi_of_a_Human_Yeast_Artin^ 
"~ ~ ^ii£222some_Librar2_&2_Homologous " — 

Recombination_to_Isolate_a_Yeast 
Artif icial_Chromosome_Clone_Derived_f 
£i!l_Human_Adenosine_Deaminase_I.o 
Synthetic oligonucleotides o6 and o7-2 werl used in 
30 the polym rase chain reaction to amplify a 1.376 base 
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pair fragment of the human ADA gene corresponding to 

positions 34,243-35,618 (Genbank Entry HUMADAG) from 

total human genomic DNA isolated from peripheral blood 

leukocytes. The amplified fragment was digested with 

PstI and the 852 base pair subfragment corresponding to 

5 HUMADAG positions 34,349-35,201 was isolated and cloned 

into the PstI site of plasmid pl84DLARG (Example 1). One 

insert orientation was chosen (that with HUMADAG 34.349 

position adjacent to the 3' end of the yeast ARG4 gene in 

pl84DLARG. The resulting plasmid was purified and 20 

10 micrograms was linearized at the unique EcoNI site within 

the human ADA insert (corresponding to HUMADAG position 

34,657) prior to transformation into the pooled YAC 

library. Transformation of the pooled YAC library was 

performed exactly as described in Example 1, with the 

15 exception being that the YAC library consisted of an 

additional 3.585 clones, for a total of 15,210 clones 

representing approximately 1.2 genome equivalents. 
+ ^ 

tour arg transf ormants were isolated. Three of 
these are displayed in figure 11 and all three displayed 

20 a unit-length linear fragment upon restriction enzyme 
digestion with EcoNI and Southern blot analysis. 
Analysis of the fourth arg"*' transformant confirmed that 
it carries the same insert as YAC 184ADA.C and 184ADA.D. 
All four transformants harbor a similarly sized YAC of 

25 ca. 200 kb , as judged by CHEF gel electrophoresis. The 

intensity of the ULL band in DNA prepared from YAC 

184ADA.B and other data indicate that YAC 184ADA.B has 
undergone multiple tandem integrations of the targeting 
plasmid. 
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Comparison of a representative YAC , YAC 184ADA.C, 
with human genomic DNA by restriction enzyme and Southern 
hybridization analysis using multiple probes and 
restriction digests confirmed that this YAC indeed 
contains sequences from the human ADA locus. ._ 

OLIGONUCLEOTIDE 06 

10 20 
5' AGATCTGTTT GAGGCTGCTG TGAG 

Bases numbered 1-24 corresponding to positions 34,243- 
34,266 in GENBANK Entry HUMADAG . 

OLIGONUCLEOTIDE o7-2 

10 20 
5' AGATCCGGCA ACTTGTAGTA CCCAGGATG 

Bases numbered 7-29 corresponding to positions 35.618- 
35.596 in GENBANK Entry HUMADAG. Bases 1-6 corresponding 
• to one of the four possible recognition sequences for the 
restriction enzyme BstYI, added to facilitate cloning. 

EXAMPLE_yiI 2Hantification_of_Effect_of 

Chromosomal_Deletions_of_Homolo£ous 
liaHences_Present_in_Host_Cell 
Orr-Weaver et_al^ (f£oc^ Natl^ Acad^ Sci^ USA, Vol. 
78. 10:6354-6358. October 1981) showed that a plalmid 
carrying the yeast LEU2 gene results in leu"" transfor- 
mants at a frequency of 1.4-1.7.per ^g of DNA when a 
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double -strand break was made in the pBR322 portion of the 
plasmid- This is 1/10 of the frequency at which leu"*" 
transf ormants arose when targeting was directed to the 
LEU2 gene by a double-strand break in LEU2 sequences 
(12-17 per pg DNA) . Similarly, when a HIS3 containing 
plasmid was cut within pBR322 sequences, his'*' trans- 
formants appeared at 1/60 of the rate observed when the 
same plasmid was cut within HISS. In both cases, the 
non-targeted prototrophs were demonstrated to be the 
results of recombination between the plasmid and the 
chromosomal leu2 and hisS" mutant genes. Thus, 
screening a library for one clone out of 50,000 by 
homologous recombination without deletion' of the chromo- 
somal LEU2 gene would be expected to yield 5,000 leu"*" 
transf ormants which arise through homologous recombi- 
nation with the yeast genome when the targeting plasmid 
carries LEU2 , even if a double - s trand targeting break is 
made in another part of the plasmid. The results 
suggest, however, that deleting the chromosomal copies of 
LEU2 and HISS would eliminate virtually all of the non- 
targeted events. 

The advantage of chromosomal deletions from host 
cells for the purposes of the method was quantified as 
follows: A plasmid carrying the yeast ARG4 ("target") 
and URA3 ("marker") genes was transformed into a mixture 
of yeast cells after making a double-strand break at the 
unique Bell site in the ARG4 sequence". All of the cells 
in the mixture had homology to URA3 , but only 1 in 1,000 
or 1 in 10,000 had homology to ARG4. This type of 
dilution experiment measures the relative frequencies of 
targeted and non- targeted events. For example, using 1" 
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pg of DNA and a 1 to 1,000 dilution, the isolation of 5 
yeast colonies by homologous recombination at ARG4 
indicates that 5,000 cells were theoretically capable of 
a targeted event, but only 5/5,000 cells actually had the 
necessary homology at ARG4 . The targeting frequency is 
5 - therefore equivalent to 5,000 targeted events per pg in 
an undiluted culture. If. in the same experiment, 5 
colonies were isolated that were independent o.f homology 
at ARG4 (recombination at URA3 or elsewhere, non-targeted 
events), the frequency of these non-targeted events is 5 
10 per /ig. and the ratio of targeted to non-targeted events 
in this experiment would be 1,000 to 1. 

For the 1 in 1,000 dilution, 78 targeted trans- 
formants were isolated (by recombination with ARG4 ; 
equivalent to 78,000 targeted events) and 17 by recombi- 
15 nation elsewhere (non- targeted events). At a dilution.of' 
1 in 10.000, four targeted events (equivalent to 40,000 
targeted events) and seven non-targeted events were 
isolated. The ratio of targeted to non-targeted events 
is thus (78.000 + 40,000) divided by (17 + 7). or 4,917 
20 to 1. This ratio would lead to approximately 10 in- 
correct events for every one correct event when screening 
a library for a sequence present on 1 in 50,000 YACs. 
which is several-fold too high to be generally accept- 
able, although the use of URA3 as a targeting marker is 
25 clearly preferred over the use of the LEU2 or HIS3 

markers previously used in targeting studies (Orr.-.Weaver. . 

et al., 1981). 84% (16 of 19 analyzed) of the non- 
targeted events where in fact due to recombination 
between the URA3 marker on the plasmid and the chromo- 
30 somal uraS" locus. If there were no homology between the 
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rargeting plasmid and the chromosomal ura3 locus, then 
the non-targeted events resulting from homology at the 
ura3 locus are removed from the analysis and the ratio 
increases to 30,729 to 1. At this ratio, a sequence 

5 represented 3 times in 50,000 YACs would be correctly 
targeted 1.8 times for every one non- targeted event. 
This ratio would also result in the favorable ratio of 
one correct event for every 1.6 incorrect events when 
screening a library for a sequence present on only 1 in 

10 50,000 YACs. 

These results indicate that the selection of a 
targeted clone from a DNA YAC library is feasible and 
particularly efficient in host yeast cells that carry no 
homology with selectable markers present on targeting 
X5 vectors. 
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CLAIMS 

A method of producing a physical map of contiguous 

DNA sequences, comprising the steps of: 

a) providing a first DNA fragment library and a 

second DNA fragment library, wherein the first 
library and the second library are each in a 
population of eukaryotic host cells in which 
genetic recombination between DNA introduced 
into the eukaryotic host cells occurs by 
homologous recombination; DNA fragments are 
present in eukaryotic host cells in an episome 
which is replicatable in the eukaryotic host 
cells and additionally comprises: 1) sequences 
necessary for propagation in bacteria, 2) two 
different marker genes for selection in 
bacteria, and 3) two different marker genes for 
selection in the eukaryotic host cells; and the 
selectable markers in the first library are not 
the same as the selectable markers in the 
second library; 

>) introducing into one DNA fragment library a 

targeting DNA vector which is non-replicating 
in the eukaryotic host cells, the targeting 
vector comprising a marker gene for selection 
in the eukaryotic host cells and targeting DNA 

„. .which is- homoLogous in part to a target DNS 

fragment, thereby producing a mixed population 
of eukaryotic host cells; 
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c) maintaining the product of step (b) under 

conditions appropriate for homologous recombi- 
nation to occur, whereby the eukaryotic host 
cells containing the target DNA fragment are 

5 stably transformed with the marker gene for 

selection in eukaryotic host cells as a result 
of homologous recombination between a target 
DNA fragment and the targeting DNA sequence and 
stably transformed eukaryotic host cells, with 

10 a selectable phenotype and containing the 

target DNA fragment, are produced; 

d) selecting stably transformed eukaryotic host 
cells by culturing the product of the previous 
step under conditions appropriate for selection 

15 of stably transformed eukaryotic host cells; 

e) digesting total DNA from stably transformed 
eukaryotic host cells with a restriction 
enzyme, thereby releasing from episomes an 
episome region which includes a target DNA 

20 fragment terminus, a marker gene for selection 

in bacteria; a marker gene for selection in 
eukaryotic host cells and sequences necessary 
for propagation in bacteria, thereby isolating 
an episome region; 

25 f) circularizing the episome region produced in 

the previous step, thereby producing a cir- 
cularized DNA" molecule which is referred to as 
a subsequent targeting vector and comprises the 
episome region produced in step (e) , wherein 

30 the target DNA fragment derived from the 

episome region is referred to as subsequent 
targeting DNAj 
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selecting and amplifying the subsequent tar- 
geting vector in bacteria; 

introducing into the other DNA fragment library 
the subsequent targeting vector, thereby 
producing a mixed population of host eukaryotic 
cells ; 

maintaining the product of step (h) under 
conditions appropriate for homologous recombi- 
nation to occur, whereby eukaryotic host cells 
containing the target DNA fragment are stably 
transformed with the marker gene for selection 
in eukaryotic host cells and the targeting DNA 
present in the subsequent targeting vector as a 
result of homologous recombination between a 
target DNA fragment and the targeting DNA 
present in the subsequent targeting vector, 
thereby producing stably transformed eukaryotic 
host cells with a selectable phenotype which 
contain the target DNA fragment that is homo- 
logous to the targeting DNA fragment present 
in the subsequent targeting vector; 
selecting stably transformed eukaryotic host 
cells by culturing the product of the previous 
step under conditions appropriate for selection 
of stably transformed eukaryotic host cells; 
repeating steps (e) through ( j ) as needed; 
constructing a physical map by ordering target 
DNA fragments derived from the episome region 
obtained in step (k) . 
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2. A method of Claim 1 wherein the targeting DNA 
molecule is a bacterial plasmid. 

3. A method of Claim 2 wherein the bacterial plasmid 
has a double -strand break introduced within the 
targeting DNA sequence. 

4. A method of Claim 1 wherein the target DNA fragment 
is cDNA. 

5. A method of Claim 2 wherein the episomes present in 
eukaryotic host cells are artificial chromosomes and 
the artificial chromosomes additionally comprise all 
of the DNA sequences necessary for the artificial 
chromosome to participate in host cell replication 
and chromosome segregation. 

6. A method of Claim 3 wherein the episomes present in 
eukaryotic host cells are artificial chromosomes and 
the artificial chromosomes additionally comprise all 
of the DNA sequences necessary for the artificial 
chromosome to participate in host cell replication 
and chromosome segregation. 

7. A method of Claim 1 wherein the target DNA fragment 
is selected from the group consisting of: mammalian 
DNA sequences; human DNA sequences; plant DNA 
sequences; mammalian genes; human genes; and plant 
genes . 
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A method of Claim 3 wherein the target DNA fragment 
is selected from the group consisting of: mammalian 
DNA sequences; human DNA sequences; plant DNA 
sequences; mammalian genes; human genes; and plant 
genes . 

A method of Claim 5 wherein the selectable marker 
gene is selected from the group consisting of genes 
which confer a selectable phenotype on eukaryotic 
host cells and the selectable phenotype is selected 
from the group consisting of: antibiotic resist- 
ance, nutrient prototrophy, tolerance to a metal 
ion. ability to progress through the cell cycle and 
expression of a cell surface marker. 

A method of Claim 6 wherein the selectable marker 
gene is selected from the group consisting of genes 
which confer a selectable phenotype on eukaryotic 
host cells and the selectable phenotype is selected 
from the group consisting of: antibiotic resist- 
ance, nutrient prototrophy. tolerance to a metal 
ion, ability to progress through the cell cycle and 
expression of a cell surface marker. 

11. A DNA sequence, isolated by the method of Claim 7, 
selected from the group consisting of: mammalian 

DNA sequences; human DNA sequences; plant DNA " 

sequences; mammalian genes; human genes; and plant 
genes . 



10 

15 
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12, A DNA sequence, isolated by the method of Claim 8, 
selected from the group consisting of mammalian DNA 
sequences; human DNA sequences; plant DNA sequences; 
mammalian genes; human genes; and plant genes. 



A method of producing a physical map of contiguous 

DNA sequences, comprising the steps of: 

a) providing a first DNA fragment library and a 

second DNA fragment library, wherein the first 
library and the second library are each in a 
population of yeast cells in which genetic 
recombination between DNA introduced into the 
yeast host cells occurs by homologous recombi- 
nation; DNA fragments are present in the yeast 
cells in an episome which is replicatable in 
the yeast cells and additionally comprises: 1) 
sequences necessary for propagation in 
bacteria, 2) two different marker genes for 
selection in bacteria, and 3) two different 
marker genes for selection in the yeast cells; 
and the selectable markers in the first library 
are not the same as the selectable markers in 
the second library; 
b) introducing into one DNA fragment library a 

targeting DNA vector which is non-replicating 
in the yeast cells, the targeting vector 
comprising a marker gene for selection in the 
yeast cells and targeting DNA which is homo- 
logous in part to a target DNA fragment derived 
from the episome region, thereby producing a 
mixed population of host cells; 
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maintaining the product of step (b) under 
conditions appropriate for homologous recombi- 
nation to occur, whereby yeast cells con- 
taining the target DNA fragment are stably 
transformed with the marker gene for selection 
in yeast cells and targeting DNA present in the 
targeting vector as a result of homologous 
recombination between a target DNA fragment and 
the targeting DNA sequence and stably trans- 
formed yeast cells, with a selectable phenotype 
and containing the target DNA fragment are 
produced; 

selecting stably transformed yeast cells by 
culturing the product of the previous step 
under conditions appropriate for selection of 
stably transformed yeast cells; 
digesting total DNA from stably transformed 
yeast cells with a restriction enzyme, thereby 
releasing from episomes an episome region which 
includes a target DNA fragment terminus, a 
marker gene for selection in bacteria; a marker 
gene for selection in yeast cells and sequences 
necessary for propagation in bacteria, thereby 
isolating an episome region; 

circularizing the episome region produced in 
the previous step, thereby producing a 
circularized DNA molecule which is referred to 
as a subsequent targeting vector and comprises 
the episome region produced in step (e), 
wherein the target DNA fragment derived from 
the episome region is referred to as subsequent 
targeting DNA; 
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g) selecting and amplifying the subsequent tar- 
geting vector in bacteria; 

h) introducing into the other DNA fragment library 
the subsequent targeting vector, thereby 
producing a mixed population of yeast cells; 

i) maintaining the product of step (h) under 
conditions appropriate for homologous recombi- 
nation to occur, whereby yeast cells containing 
the target DNA fragment are stably transformed 
with the marker gene for selection in yeast 
cells and the targeting DNA present in the 
subsequent targeting vector as a result of 
homologous recombination between a target DNA 
fragment and the targeting DNA that is homo- 
logous to the targeting DNA fragment present in 
the subsequent targeting vector, thereby 
producing stably transformed yeast cells with a 
selectable phentoype and which contain the 
target DNA fragment present in the subsequent 

20 targeting vector; 

j) selecting stably transformed yeast cells by 

culturing the product of the previous step 

under conditions appropriate for selection of 

stably transformed yeast cells; 
^> repeating steps (e) through ( j ) as needed to 

isolate target DNA fragments; and 
1) constructing a physical map by ordering target 

DNA fragments isolated in step (k) . 
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A method of Claim 13 wherein the targeting DNA 
vector is a bacterial plasmid. 



15. A method of Claim 14 wherein the bacterial plasmid 
has a double-strand break introduced within the 

5 targeting DNA sequence. 

16. A method of Claim 14 wherein the episomes present in 
eukaryotic host cells are artificial chromosomes and 
the artificial chromosomes additionally comprise all 
of the DNA sequences necessary for the artificial 
chromosome to participate in host cell replication 
and chromosome segregation. 



10 
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A method of Claim 15 wherein the episomes present in 
eukaryotic host cells are artificial chromosomes and 
the artificial chromosomes additionally comprise all 
of the DNA sequences necessary for the artificial 
chromosome to participate in host cell replication 
and chromosome segregation. 

A method of Claim 13 wherein the target DNA fragment 
is cDNA. 

A method of Claim 13 wherein the target DNA fragment 
is selected from the group consisting of: mammalian 
D.NA .sequences; human DNA sequences ; plant DNA 
sequences; mammalian genes; human genes; and plant 
genes . 
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20. A method of Claim 15 wherein the target DNA fragment 

is selected from the group consisting of: mammalian 

DNA sequences; human DNA sequences; plant DNA 

sequences; mammalian genes; human genes; and plant 
genes . 



21- A method of Claim 16 wherein the selectable marker 
gene is selected from the group consisting of genes 
which confer a selectable phenotype on yeast cells 
and the selectable phenotype is selected from the 
10 group consisting of: antibiotic resistance, 

nutrient prototrophy, tolerance to a* metal ion, 
ability to progress through the cell cycle and ex- 
pression of a cell surface marker. 



22, A method of Claim 17 wherein the selectable marker 
15 gene is selected from the group consisting of genes 

which confer a selectable phenotype on yeast cells 
and the selectable phenotype is selected from the 
group consisting of: antibiotic resistance, 
nutrient prototrophy, tolerance to a metal ion, 
20 ability to progress through the cell cycle and ex- 

pression of a cell surface marker. 



23. A DNA sequence, isolated by the method of Claim 19, 
selected from the group consisting of: mammalian 
DNA sequences; human DNA sequences; plant DNA 
25 sequences; mammalian genes; human genes; and plant 

genes . 
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A DNA sequence, isolated by the method of Claim 20, 
selected from the group consisting of: mammalian 
DNA sequences; human DNA sequences; plant DNA 
sequences; mammalian genes; human genes; and plant 
genes . 



A method of Claim 13 wherein in step (b) , the 
selectable marker gene for selection in yeast linked 
to targeting DNA is present in a replicating plasmid 
when introduced into the yeast cells and • the 
selectable marker gene and the targeting DNA are 
subsequently released from the replicating plasmid 
as a non-replicating molecule. 

A method of fragmenting human genomic DNA suitable 
for incorporation in a recombinant -DNA library which 
is to be used for mapping contiguous genomic DNA 
fragments, comprising digesting human genomic DNA 
with at least one restriction endonuclease selected 
from the group consisting of: Apal, Nsil and Seal, 
thereby selecting against the occurrence of certain 
repetitive DNA sequences at the termini of the DNA 
fragments produced. 

Saccharomxces cerevisiae carrying a chromosomal 
deletion in four selectable marker genes. 

Saccharomyces cerevisiae of Claim 27 wherein the 
four selectable genes are ARG4 . TRPl, LEU2 and URA3 . 
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29. A YAC arm vector comprising: 

a) a yeast selectable marker gene; 

b) a bacterial origin of replication; 

c) a bacterial selectable marker gene; and 

d) a yeast telomere. 



30. The YAC arm vector of Claim 29 additionally com- 
prising a yeast replication origin, yeast centromere 
sequences or both. 



31. A YAC arm vector selected from the group consisting 
10 of: 

a) pTKENDA; 

b) PTKENDA2; 

c) pTKENDB; 

d) pTKENDC; and 
15 e) pTKENDD. 

32. A yeast artificial chromosome comprising: 

a) two DNA sequences for replication in bacterial; 

b) two selectable marker genes for selection in 
bacteria; 

20 c) two yeast telomere sequences; 

d) a yeast centromere; 

e) two selectable marker genes for selection in 
yeast; and 

f) one or more yeast origins or replication. 
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33. A DNA fragment library comprising a yeast strain 

carrying a chromosomal deletion of four selectable 
marker genes, the yeast strain having incorporated 
therein a pair of YAC arm vectors, the members of 
the pair referred to as a first YAC arm vector and a 
second YAC arm vector, each YAC arm vector com- 
prising : 

a) a yeast selectable marker gene which is one of 
the four selectable marker genes for which the 
yeast strain carries a chromosomal deletion; 

b) a bacterial origin of replication; 

c) a bacterial selectable marker gene; and 

d) a yeast telomere, 

wherein a) one member of the pair of YAC arm vectors 
15 includes a DNA sequence which functions as a centro- 

mere in yeast; b) one or both of the YAC arm vectors 
includes an origin of replication functional in 
yeast; c) the first YAC arm vector and the second 
YAC arm vector are each ligated to a DNA fragment of 
20 non-yeast origin; and d) the first YAC arm vector 

and the second YAC arm vector each comprise a yeast 
selectable marker gene not present in the other 
member of the pair. 



25 



34. The DNA fragment library of Claim 33 wherein the 

yeast strain carries a chromosomal deletion in the 
ARG4, TRPl, LEU2 and URA3 genes. 



35. 



The DNA fragment library of Claim 34 wherein the DNA 
fragment of non-yeast origin is a mammalian DNA 
fragment or a plant DNA fragment. 
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36. The DNA fragment library of Claim 35 wherein the 
mammalian fragment is a human DNA fragment. 

37. A pair of DNA fragment libraries, comprising: 

a) a yeast strain carrying a chromosomal deletion 
of four selectable marker genes and having 
incorporated therein a pair of YAC arm vectors, 
the members of the pair referred to as a first 
YAC arm vector and a second YAC arm vector, 
each YAC arm vector comprising: 

1) a yeast selectable marker gene which is 
one of the four selectable marker genes 
for which the yeast strain carries a 
chromosomal deletion; 

2) a bacterial origin of replication; 

3) a bacterial selectable marker gene; and 

4) a yeast telomere, 

wherein a) one member of the pair of YAC arm vectors 
includes a DNA sequence which functions as a centro- 
mere in yeast; b) one or both of the YAC arm vectors 
includes an origin of replication functional in 
yeast; and c) the first YAC arm vector and the 
second YAC arm vector are each ligate'd to a DNA 
fragment of non-yeast origin; 

b) the yeast strain of a) having incorporated 

therein a second pair of YAC arm vectors, the 
members of the pair referred to as a third YAC 
arm vector and a fourth YAC arm vector, each 
arm vector comprising: 
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1) a yeast selectable marker gene which is 
one of the four selectable marker genes 
for which the yeast strain carries a 
chromosomal deletion; 

2) a bacterial origin of replication; 

3) a bacterial selectable marker gene; and 

4) a yeast telomere, 

wherein a) one member of the second pair of YAC arm 
vectors includes a DNA sequence which functions as a 
centromere in yeast; b) one or both of the YAC arm 
vectors includes an origin of replication functional 
in yeast; and c) the third YAC arm vector and the 
fourth YAC arm vector are each ligated to a DNA 
fragment of non-yeast origin; and wherein the first, 
second, third and fourth YAC arm vectors each 
comprise a yeast selectable marker gene not present 
in the other YAC arm vectors. 



A DNA fragment library comprising a yeast strain 
carrying a chromosomal deletion of the ARG4 gene, 
the TRPl gene, the LEU2 gene and the URA3 gene, the 
yeast strain having incorporated therein a pair of 
YAC arm vectors, each member of the pair ligated to 
a DNA fragment of non-yeast origin and the pair 
selected from the group consisting of: 
25 a) pTKENDA and pTKENDB; 

. - b) pTKENDA2 and pTKENDB; 

c) pTKENDA and pTKENDC; 

d) pTKENDA2 and pTKENDC ; 

e) pTKENDC and pTKENDD; and 
30 f) pTKENDB and pTKENDD. 
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39. The DNA fragment library of Claim 38 wherein the DNA 
fragment of non-yeast origin is a mammalian DNA 
fragment or a plant DNA fragment, 

40. The DNA fragment library of Claim 39 wherein the DNA 
fragment of mammalian origin is a human DNA 
fragment . 
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Oligonucleotide 

12 GGTCTCTACA GGTTCTGACA TTATT 

13 CCGGCGTAGA GAATCCACAG GACGG 

14 CTCCTGATGA CGCATGGTTA CTC 

15 GGAAAGAAAT GCACAAGCTT TTGCC 

16 CCGATACCAG GACCTTGCCA TCC 



FIGURE 8c 



SUBSTITUTE SHEET 



wo 93/03183 



14/17 



PCr/US91/08679 



1234567 89 






FIG. 9 



SUBSTITUTE SHEET 



wo 93/03183 



15/17 



PCr/US91/08679 




FIG. 10 



suBsrrruTE sheet 



wo 93/03183 



1 6 / t 7 



PCr/US91/08679 



1 2 3 4 5 6 




FIG. II 



SUBSTITUTE SHEET 



wo 93/03183 



17/17 



PCr/US91/08679 



U 

c> 
o 



u 

o 
5 



ro 
tt) 

3 
U 

o 



U 

-S? 
o 



II It 

0) c ^ 5i 
CO o o 5 




CD 
Ll 



SUBSTITUTE SHEET 



Inltmitional AppUcation No: PCT/ US91 08679 



MICROORGANISMS 

Oyllml ShMl In eonnactlon with the mlcreerotnlMi r«««rr*d t0 en (»o«__L5__ 



1 Q_0 1 

, to» /-^ of ttM dMCJtpden » 



. lOIMTinCATWII OF DIFOSIT * 
FwthK tf*po»ni sf« Wwittftad on tn addmonal thMl Qa 



American Type Culture Collection 



AMr»M •« 49pottuf iMimitlon (Inchidlno po«tal cod* M countiy) ♦ 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



D«t* of d*po«n • 

28 June 1991 



A cc — Number « 

#s 74010, 40832 and 40833 



ADOmOMAL INDICATIONS » (Imv* bUnh II not applleabU). ThU tnfofmatlon U contintMd on « ooponta ottachpd thaot^Q 



In respect of those designations in which a European 
Patent is sought, the Applicant hereby informs the 
European Patent Office under European Rule 28(4) that, 
until the publication of the mention of the grant of the 
European Patent or until the date on which the European 
Application has been refused or is withdrawn or is deemed 
to be withdrawn, the availability of the biological 
material deposited with the i^eri^^ Type Culture Collec- 
tion under Accession Nob. ASa^^OBSr^^ (Cont'd Appendix A) 



C. DCSIONATtO OTATtS fon WNICH INDICATIONS ARB MADI • (11 th* IndlcaUona not for aU daolQAatad Stataa) 



EP, AU, CA, JP, KR 



O. SI^ARATt niRNISHINS OF INDICATtONS • (laaw Wank M not appUttbIa) 



- AcSJiSw^SSmSlT'of O^'^^^ •ubmlttad to tbo tmamatlonal Buraau latar • (Spocify tha panaral natuf. ol Ibo Indtoattena a.g. 



I* □ Thia ahaal waa racatvad witb tha Intarnationai applicailen whan filad (lo ba ehackad by tha racafvmo Offica) 



(Aulhorisod OAcar) 
g Tha data of racwpt (from tha appUcant) by tha IntamationaJ Suraau 



:i S OCT 1992 
(16.ia92^ 



h 



(AutAeibad Oflleaf) 



rofw PCT/H0/1J4 (January iMl) 



(Jaouiry 1991) 



APPENDIX A 



Box B Continued 



shall be effected only by the issue of a sample to an expert 
nominated by the requester in accordance with European Rule 
28(5). In accordance with Regulation 3.25 of the Patents 
Regulations (Australia Statutory Rules 1991 No. 71) , samples 
of materials deposited in accordance with the Budapest 
Treaty in relation to this Patent Request are only to be 
provided before: The patent is granted on the application; 
or the application has lapsed or been withdrawn or refused; 
to a person who is: a skilled addressee without an interest 
in the invention; and nominated by a person who makes a 
request for the furnishing of those samples- 





INTERNATIONAL SEARCH REPORT 


PCT/US 91/08679 




iDteniatto&al AppUcatioo No 


I. CLASSmCATION OF SUBJECT MATTER scvcni dassificatioD symbols apply, indicate alt)^ 


AccorrftDg to IntemationaJ Patent Oasaficatioo (IPQ or to both NationaJ OassxAcatioa and IPC 




Int. CI. 5 C12Q1/68; C12N15/81 




U. FIELDS SEARCHED 


Minifflum Docnmcotatioo Searched' 


Ctassificatiott System 


Qassificatioo Symbols 


Int.Cl. 5 


C12Q 





Documentation Searched other than Minimuo Docnmcntatioo 
to the Extent that such Documents are Included in the Fields Searched' 



m. DOCUMENTS CONSIDERED TO BE RELEVANT^ 



Category* 



Citation of Document, with indication, where appropriate, of (he relevant passages 



Relevant to Qalm NoP 



GENE. 

vol. 106, June 1991, AMSTERDAM NL 
pages 97 - 101; 

G. D. STEWART ET AL: 'Plasralds for recombination 
based screening' 

GENE. 

vol. 85, 1989, AMSTERDAM NL 
pages 35 - 43; 

S. KURACHI ET AL: 'Suroo 15A: a lambda phasraid 
that permits easy selection for and against 
cloned inserts* 



Special categories of dtcd documents : 

'A* document defining the general state of the art which is not 
considered to be oif particular relevance 

'E* earlier document but published on or after the Internationa] 
filing date 

*L' document which may throw doubts on priority claim(s) or 
which is dted to establish the publicauon date of another 
citation or other special reason (as specified) 

'O' document referring to an oral disclosure, us^ exhibition or 
other means 

n** document published prior to the tntcmattonal filing date but 
later than the priority date dalraed 



later document published after the international filing date 
or priority date and not in conflict with the application but 
dted to understand the prindple or theory underiying the 
invention 

document of particubr relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step 

document of particular rdevance; the daimed invention 
cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination bdng obvious to a person skilled 
in the art. : . . . ^ 

document member of the same patent^ family - 



IV. ceruhcation 



Date of the Actual Completion of the International Search 

21 APRIL 1992 


Date of Mailing of this International Search Report 

1 2. 05. 92 


International Seardiing Authority 

EUROPEAN PATENT OFHCE 


Signature of Authorized Officer jj 

MOLINA 6ALAN E^.^^^ ^-t^^ 



iludMi«niy 1915) 



PCT/US 91/08679 



lotcmtioDal AppUation No 



HLDOOJ 


MENTS CONSIDEHED TO BE HELEVANT (CONTOrtJfa) FROM THE SECOND SHtti; 
atitf on of Document, with Indication, where appropiitte, of the idevint passages 




A 


NATURE. 

vol. 305, 15 September 1983, LONDON GB 
pages 189 - 193; ^ . , 
A. W. MURRAY ET AL: 'Constructions of artificial 
* chromosomes In yeast* 
cited In the application 





Farm PCT/XSA/UO (otn iM) (Jmmaf 1115) / 



